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Preface 



This book deals with the proeesses, methods, and coneepts that lie behind modem 
models of combat. It is intended for readers with at least a scientific bachelor’s 
degree and some background in basic probability concepts. It includes three ap- 
pendices that address general quantitative methods that are applied throughout the 
book. Appendix A is a review of probability concepts, which are involved in al- 
most every chapter. A reader who has never studied probability before should con- 
sider reading this appendix, at least, before continuing. Appendices B and C are 
minimal introductions to the topics of optimization and Monte Carlo simulation. 

Use will be made throughout this book of Microsoft ExceF’^ workbooks de- 
veloped by the authors. Some of these workbooks include VBA (Visual Basic for 
Applications) code for certain functions or commands. The reader should obtain 
the zipped folder CombatModelingl .zip from the “downloads” link at the url 
http://faculty.nps.edu/awashbum/ . After obtaining the folder, unzip it and read 
Readme.txt and Erratal.txt. As long as proper credit for authorship is given, the 
workbooks may be freely distributed. An email link for author Washburn will also 
be found at that url or substitute “mkress” for “awashburn” to reach author Kress. 

Feedback will be appreciated. If you find an error, please point it out so we 
can incorporate it in the errata. If there are subsequent editions of this book, we 
will take advantage of your comments about organization and utility. 

We hope you find our book useful. 

Alan Washburn 

Operations Research Department 

Naval Postgraduate School 



Moshe Kress 

Operations Research Department 
Naval Postgraduate School 
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Chapter 1 

Generalities and Terminology 

I’m no model lady. A 
model ’s just an imitation of 
the real thing. 



Mae West 



1.1 Introduction 

A model is an abstraction of reality. An abstraction can take many forms: an archi- 
tect might construct a physical miniature model of the building he plans; a CEO of 
a corporation might use a diagram to present a new business idea, and a physicist 
might use a set of differential equations to represent some physical phenomenon. 
The need for models stems from the fact that the real world is too complicated for 
us to reason about and contains many details that are not necessarily relevant. Our 
limited intellects permit us to deal only with abstractions that retain the essence of 
the matter without the distracting details. The miniature model of the building, the 
diagram, and the set of differential equations are manifestations of these abstrac- 
tions and henceforth called simply models. Thus, models are entities of various 
types - physical, notional, or mathematical - that share the fact that they represent 
only an abstraction of a real object or situation. Models are used for reasoning, 
insight, planning, and prediction. They need to capture the key factors of the 
object or situation and faithfully represent them so that the models can be util- 
ized effectively. 

This book is about combat models, models that describe or represent weapon 
systems and combat situations. As mentioned above, there are several types. In 
order to organize our thinking on the matter, it is important to introduce terms by 
which one type of model can be distinguished from another, as well as terms about 
how such models are developed, used, and tested. The development of such a lexi- 
con is one object of this chapter (Section 1.2). Another object (Section 1.3) is to 
discuss certain modeling assumptions that are employed frequently enough to de- 
serve names. It is the authors’ contention that these “shortcut” assumptions are 
dangerous as well as useful and powerful and should therefore be discussed im- 
mediately in a book such as this. Section 1.4 summarizes the notational conven- 
tions that will be used throughout the book, and Section 1.5 is an overview of 
subsequent chapters. 
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1 Generalities and Terminology 



1.2 Classification of Combat Models 

Unfortunately, the business of combat modeling has long suffered from a lexicon 
that often obscures fundamental relationships and parallels. There have been sev- 
eral attempts to remedy this by authorities who announce glossaries of what terms 
should mean, but none of these have been completely successful. It is more or less 
like trying to impose standards on (say) French - the people will speak as they 
wish, despite the best efforts of the authorities. Terms such as “identity simula- 
tion” and “agent-based simulation” are regularly used in spite of being missing 
from such glossaries and seem to ease communication between those in the know. 
Still, one ought to make an effort to use terms that are in some sense standard. Our 
approach in this book will roughly follow the glossary defined by the United 
States Department of Defense (DoD, 1998). Our first, definitive use of terms will 
be italicized in the following. 

The real combat situation will inevitably differ from its model, particularly be- 
cause of the complexity and uncertainty associated with combat. Judging the ex- 
tent to which the model agrees with the real world is called validation, to be dis- 
tinguished from judging whether a particular implementation of a model (e.g., a 
computer program) is correct in the sense of being faithful to the model. The latter 
activity is called verification and has nothing to do with the real world, at least not 
directly. 

Models are either stochastic or deterministic. Intuitively, a stochastic model as- 
sumes uncertain or probabilistic inputs regarding an experiment or a situation and 
makes an indefinite prediction of the results. A deterministic model states exactly 
what will happen, as if there were no uncertainty. More formally, a stochastic 
model requires the terminology of the theory of probability (events, random vari- 
ables and probabilities, see Appendix A) for its description, whereas a determinis- 
tic model does not. The contrast is particularly important for Lanchester models 
(Chapter 5), where the same data can be employed by both types of model. Most 
combat models considered in this book are stochastic. 

The DoD glossary points out that the terms “modeling” and “simulation” are 
often used interchangeably, but nonetheless offers a separate definition of simula- 
tion. Our habit in this book will be to use the word simulation only to refer to a 
particular method for implementing stochastic models that generates a sequence of 
replications of an abstract experiment, making inferences from the sequence. The 
most important kind of simulation is Monte Carlo simulation, as described in Ap- 
pendix C. This type of model is one of seven shown in Fig. 1 . The three types to 
the right of it are also simulations, by our definition, since all face the problem of 
generating sufficient replications in an uncertain environment to enable reliable in- 
ference. The models differ in the way in which replications are generated and 
managed, whether computationally or manually. Contrary to simulations, analytic 
models do not rely on multiple replications, instead developing formulas that de- 
scribe results in a concise way. Analytic models may or may not be stochastic. 
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Figure 1: The combat modeling spectrum 



Figure 1 shows the “combat modeling spectrum” with an emphasis on the 
trade-off between abstraction and realism. The primary division is between mod- 
els that explicitly aim to find optimal decisions (prescriptive or normative models 
on the left side) and models that do not (descriptive or evaluative models on the 
right side). Prescriptive models aspire to prescribe optimal courses of action in 
complex combat situations. It is natural to expect that such models will require 
greater abstraction in order to find a useful and tractable implementation. Descrip- 
tive models describe combat phenomena and processes without prescribing 
courses of action, so they can be less abstract. The objective of a descriptive 
model is to gain insights about the main components of a system or the evolution 
of a combat situation, analyzing cause-and-effect relations and comparing alterna- 
tive courses of action. Sometimes, in rare situations where the uncertainty is lim- 
ited and well defined, and the model is supported with sufficient reliable data, the 
descriptive models may also be used for predictions of combat outcomes. Pre- 
scriptive models are analytic models with special ambitions. Optimization models 
deal with only a single decision maker, seeking a decision that is in some sense 
“optimal” (see Appendix B), whereas game theoretic models deal with multiple 
decision makers. Wargaming is a type of simulation that also involves multiple 
decision makers, using humans in an effort to achieve realism. Game theory and 
wargaming are introduced and contrasted in Chapter 7. At this point note only that 
these two methods for dealing with multiple decision makers are on opposite ends 
of the spectrum of abstraction. 

Among descriptive models, analytic models, even when stochastic, are innately 
verifiable in that separate implementations should result in identical results. If two 
such implementations differ, then at least one of them is wrong. Simulations (the 
four types to the right in Figure 1) are generally formulated because an analytic 
model is either not available or intractable. If a stochastic model involves the use 
of a random number generator, then the model is either a Monte Carlo simulation 
or something further to the right. Because of the randomly generated replications, 
two correct Monte Carlo implementations can achieve different results, so verification 
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1 Generalities and Terminology 



is a more difficult subject than it is with analytic models (statistical issues are 
involved, as noted above the arrow in Figure 1). If a model involves even a single 
decision made hy a flesh-and-blood human, then the implementation is either a 
man-in-the-loop simulation or something further to the right. If multiple humans 
with different goals make decisions, then the implementation is either a wargame 
or something further to the right. Finally, if any part of the implementation is directly 
affected by real-world phenomena, then the implementation is an exercise. Since 
exercises deal at least partially with the real world, some of their aspects are not 
controllable, and there may he irresolvable questions about what actually hap- 
pened when an exercise is complete. The further to the right in Figure 1, the more 
difficult the verification. We will have nothing further to say about exercises in 
this book, and very little to say about man-in-the-loop simulations. 

A given abstraction of a certain real-world situation can sometimes be imple- 
mented in multiple ways. Consider the real-world problem of flipping a fair coin 
three times, noting whether the event “two or more heads” happens. An analytical 
model would use the theory of probability (e.g., the Binomial distribution, see 
Appendix A) to conclude that the probability of the event is exactly 0.5. A Monte 
Carlo model of this situation is also possible where the three flips are simulated 
using random numbers, recording the fraction of the replications on which the 
event happens. The analytical model is surely best for this small problem, since 
the exact answer is derivable in a transparent manner with very little effort. In 
more complicated circumstances, the Monte Carlo model might be superior on the 
grounds of either efficiency or transparency. This particular question - whether an 
analytic or a Monte Carlo implementation should be pursued - is important and 
frequently occurring in studying stochastic models of combat. It will frequently be 
encountered in the sequel. The two methods are not incompatible - one useful 
verification tactic is to build both versions and compare them. 



1.3 Modeling Shortcuts 

As mentioned previously, most combat models considered in this book are sto- 
chastic. The inherent uncertainty of combat usually demands the type. Such mod- 
els can be very complex, and therefore, in order to be able to implement them, 
sometimes simplifying “shortcut” assumptions are imposed. While these assump- 
tions are powerful and facilitate efficient model implementation, they are also 
dangerous if not controlled and interpreted correctly. In this section we review 
some of the more commonly used shortcuts. 
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1.3.1 Expected Value Analysis 

Many combat models are multi-stage in the sense that stage 1 determines some 
important input (random variable X, say) to stage 2. X might be the number of at- 
tackers that survive an initial defense or the number of helicopters available for 
the second stage of a mission after surviving a sandstorm in the first stage. Let 
Y = f (X) be some scalar measure of the overall success of both stages and imag- 
ine that X is random. Since X is random so is Y. We would like to know the ex- 
pected successfulness of the two-stage operation or E{Y) (the expected value op- 
erator E 0 is introduced in Appendix A). The trouble is that finding E{Y) requires 
us to evaluate / (X) for all possibilities of X, and there may be many of them. If 
the function f(X) is difficult or expensive to evaluate, it will be tempting to 
evaluate only f{E{X)), the overall successfulness of the second stage given the 
average output of the first stage, instead of the more complicated if (/(A)) , the 
average successfulness of the two-stage operation. The systematic replacement of 
random variables by their expected values in this manner is called Expected Value 
Analysis (EVA), 

EVA is often harmless. We are told that “someone dies on a highway in the 
United States every 13 min.” That statement is not true. It would take a tremen- 
dous and pointless coordination effort to make it be true that highway deaths fol- 
low each other by exactly 13 min. What is true is that the average interval be- 
tween highway deaths is 13 min. The statement simply omits the word “average”, 
which is harmless as long as it is only used to estimate highway deaths over long 
intervals of time. However, EVA is not always harmless. Consider a system of n 
redundant controls designed to make sure that a nuclear attack does not start acci- 
dentally. Each control works independently with probability 0.9, and the system 
works as long as at least one control works. By EVA, we could argue that two 
such controls suffice. If X is the number of functional controls, then E(X) = 2(0.9) 
= 1.8, which is safe because 1.8 exceeds 1.0. There are two things wrong with this 
EVA analysis, one cosmetic and one fundamental. The cosmetic problem is that 
clearly the number of functional controls cannot be 1.8, since it is always an inte- 
ger. The fundamental problem in this example is that the EVA outcome is insuffi- 
cient to tell us the probability of an accidental nuclear attack, which is far too large 
when only two controls are employed (0.01, to be exact). As another example con- 
sider a tank crossing a minefield at night. There is a clear passage through the 
minefield but the disoriented driver of the tank may drive either 10 m right or left 
of the clear passage, with probability 0.5 either way. Using EVA, we might con- 
clude that the tank always drives exactly on the clear passage, since the average 
deviation is 0, and is therefore safe, while in reality it never drives on the clear 
passage and could very well get blown up. In addition to losing all track of vari- 
ability as shown in the first example, the deterministic “equivalent” model 
achieved by EVA can be wrong even on the average, as shown in the second. 
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If EVA is such a bad idea, then why do people do it? The primary answer is 
that EVA leads to deterministic models, and deterministic models are simpler than 
stochastic models, both conceptually and computationally. The point is sometimes 
made that combat models are inevitably only poor approximations to reality, so 
much so that approximating certain random variables by their average values is 
not worth worrying about. That is a good point, but there are some strong counter- 
arguments to it (Lucas, 2000). The introduction of EVA can turn an approximate 
but useful model into one that is actually misleading. 

Our point here is not that EVA is something always to be avoided, but only that 
the analyst should be aware of the dangers. The possible employment of EVA is 
one of the significant decisions that an analyst must make in formulating a sto- 
chastic model, so it is important to at least have a name for it. This subject is taken 
up again in Chapter 5, where the stochastic and deterministic versions of Lanches- 
ter’s equations are compared. 



1.3.2 Universal Independence 

Stochastic models generally deal with the interactions of a variety of different 
phenomena, and the statistical relationships between them are important to both 
model verity and model tractability. It is nearly always true that the most tractable 
assumption is that the phenomena are all independent of each other, so much so 
that the habit of making this assumption deserves a name: Universal Independence 
(UI). There are two basic reasons for UTs popularity: 

(1) It often is not a bad assumption about the real world. There are lots of phe- 
nomena that simply have nothing to do with one another in any practical sense, in 
spite of fables about the outcome of a battle depending on the flight of a butterfly 
thousands of miles away. 

(2) The theory of probability is usually at its simplest when dealing with inde- 
pendent events and random variables. The variance of a sum of random variables, 
for example, is equal to the sum of the individual variances when the random vari- 
ables are independent, but not otherwise. 

In spite of the first reason, there are circumstances where the assumption of in- 
dependence is dangerously bad. Eor example, let E and E be the events that tanks 
1 and 2 survive a transit of a minefield. Are E and E independent? They might be 
if the tanks travel independently, but they are certainly not independent if tank 2 
simply follows the path of tank 1 . If tank 1 survives, then its path must be safe, so 
tank 2 can also survive by following the same path. The validity of the independ- 
ence assumption thus depends on how tanks actually travel through minefields. 
Since one of the principal countermeasures to a minefield is for those forced to 
cross it to all follow the same path, a model based on UI is likely to be highly op- 
timistic about results from the viewpoint of the minefield planner. This observa- 
tion has been the cause of considerable turmoil in software for minefield planning 
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(see Chapter 8 for details). Another bad application of UI would be when several 
shots from a certain weapon share the problem that the weapon’s fire control sys- 
tem is biased, resulting in a common aiming error (see Chapter 2). 

Our point here is not that the assumption of independence is a bad idea; indeed, 
we will frequently employ it in the models developed in this book. It is instead the 
casual assumption of independence, as if the assumption were somehow a stan- 
dard that ought not to be disputed, that we wish to warn of. The popularity of UI 
stems from the tractability that it almost always lends to models of combat. Valid- 
ity, however, is something that ought to be carefully considered before employing 
it. 



1.3.3 Tuning Parameters 

Combat models sometimes incorporate parameters whose exact meaning is hard to 
divine, even with an English description. Possible descriptions might be “propen- 
sity to communicate,” “shrink factor,” or “surrender coefficient.” Such parameters 
are typically present because combat models must sometimes deal with aspects of 
the real world that cannot be ignored, even though they are poorly understood. 
These parameters are sometimes adjusted by experts so that model results are 
more or less valid; i.e., “tuned.” The problem is that model results can sometimes 
be quite sensitive to these parameters, which is dangerous because they represent 
poorly understood phenomena. The worst situation is when the model is being 
used to advocate some new tactic or combat system, since tuning parameters can 
be adjusted to make novelty appear advantageous. 

Extreme assumptions are often easier to model than intermediate assumptions, 
so a tuning parameter might be a bridge from one extreme to another that effects a 
realistic intermediate compromise. An example of this occurs in trying to predict 
the probability that an array of several sensors will detect a target, given detection 
probabilities for the individual sensors. We might apply UI and model all the de- 
tection events as independent, an assumption that is almost surely optimistic if the 
sensors are located physically near each other. An equally extreme assumption in 
the pessimistic direction would be that all sensors are completely dependent, in 
which case the whole array is equivalent to whichever single sensor has the largest 
detection probability. Having found an optimistic answer A and a pessimistic an- 
swer B, we might invent an “independence factor” /and let the array’s detection 
probability be ^(1 - /) -I- f/". As / moves from 0 to 1, the detection probability 
moves from optimistic to pessimistic. Ideally, / would be determined in experi- 
ments involving actual sensor arrays and actual targets or at least tuned by an ex- 
pert so that the predicted detection probabilities seem realistic. The tuning factor 
might temporarily achieve realism, but it also disguises our ignorance of how de- 
tections actually happen. Suppose someone invents a new sensor array whose 
elements are much closer together than the array for which the model was tuned. 
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We ought to be suspicious that the tuning factor for the new array should be 
smaller, but have no means of judging how much smaller, and might very well 
leave /unchanged because it is too much trouble to worry about. 

Necessity often dictates the use of tuning factors, but the reader should be alert 
to the dangers. Whenever you discover something called the “communications ad- 
justment factor” in a combat model, a skeptical investigation is appropriate. 



1.3.4 The Ostrich Effect 

Another alternative when encountering a poorly understood or controversial phe- 
nomenon in the course of formulating a combat model is to simply ignore it. Since 
the phenomenon is not even addressed, this tactic may forestall some of the skep- 
tical questions likely to arise when using a tuning factor. Ignoring the phenome- 
non, and thus not assigning any values to the parameters that represent it, may 
very well generate less controversy than assigning them questionable values. 

An example of this effect is presented in Morse and Kimball (1950). At one 
time merchant ships traveling across the Atlantic were given torpedo nets to en- 
tangle torpedoes launched by U-boats. The nets worked, but the analysts of the 
time concluded that they were a bad idea because they cost more than the ships 
and cargoes that they saved. In coming to this conclusion, the analysts ignored the 
fact that the nets also saved the lives of merchant seamen, effectively valuing 
those lives at zero. The analysts’ recommendations were taken (the torpedo net 
idea was abandoned), but imagine what might have happened if the lives had been val- 
ued at (say) $100 each. Any value on human life is controversial, so the whole process 
might have become hung up on debates about whether $100 is the right number. Surely 
$100 is closer to the value of a human life than $0, as the analysts effectively assumed, 
but assuming $0 forestalled the debate by not even bringing up the subject. 

It is sometimes not obvious whether a certain “detail” can be ignored or not. 
Suppose you are investigating how a diesel-electric submarine should evade a pur- 
suer while submerged. Such submarines are dependent entirely on the energy stored 
in their batteries, so it might be tempting to model the batteries as an energy supply 
E that can be expended at whatever rate the submarine chooses, as long as energy 
consumed never exceeds E. The problem is that the submarine’s best escape tactic 
might very well involve high speeds, therefore high power consumption, and there- 
fore high battery currents. Batteries can waste a significant fraction of their power 
on internal resistance whenever the withdrawn current is high, which leads to less 
energy available for avoidance. You therefore have two choices: 

• Persist with the original assumption, ignoring the effects of battery 
resistance. 

• Find out more about batteries. 
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The first choice corresponds to employing the ostrich effect to internal resistance, 
since you will never have to mention the subject. The second is likely to require a 
large investment of time, since it turns out that there is much more to learn about 
batteries, some of which is likely to be classified. You have reached a crucial point 
in the process of constructing a model of the situation. 

A model, being an abstraction, will always involve ignoring certain aspects of 
the real world. Many of these omissions are obviously harmless, but some are not. 
It is important to remember what has been omitted, and perhaps to test for signifi- 
cance after analysis is complete. In the battery example above, it may turn out that 
the submarine does not want to go fast for other reasons, even if its battery has no 
internal resistance, so the assumption that there is no resistance is harmless. It 
could also turn out that the details of how batteries work are essential if one wants 
to advise submarines on how to evade a pursuer. The latter possibility should be 
borne in mind, since the significance of an abstraction may not be clear until the 
model is used for something. The ostrich effect sometimes makes us forget that 
there ever was an issue, and that is the danger. 



1.3.5 Convenient Distributions 

There are a few probability distributions that are commonly used in combat mod- 
els because of their tractability and the elegant results they can produce. Among 
these distributions we find the Poisson, exponential, uniform, and normal distribu- 
tions (see Appendix A). Behind each such distribution hides several assumptions 
that may or may not be applicable to the system or situation being modeled. For 
example, the memoryless property of the exponential distribution is often analyti- 
cally convenient, to the point where it is sometimes assumed without justification. 
This habit is potentially dangerous. In some situations it is a reasonable approxi- 
mation to the actual behavior of a system, but in others (e.g., the “up” time of a 
machine that is subject to a mechanical wear) this may not be the case. 

Another example is the normal distribution that is widely used in firing theory 
(see Chapter 2). There is strong empirical evidence that justifies the use of this 
distribution in certain firing situations (e.g., artillery); however, in other firing 
situations (e.g., precision-guided missiles) this may not be the case. As with the 
other shortcuts mentioned above, caution must be exercised to make sure that the 
choice of probability distribution is appropriate for the situation. 
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1.4 Notation and Conventions 



The following notation is used throughout the book. 



^ means sum, ]”[ means product, J means integral; and — denotes a de- 
rivative with respect to x. Thus, 



means 



4 4 4 , 

i = 10, ]”[/ = 24, I xdx = 8, and — (x^ /2) = x . 




When no limits are specified and context makes clear what “everything” means. 



When no limiting statement is made, “for all” should be understood as “for all 
possible values”. 

max {2, 1, 7, 3} is 7, and more generally max^^j(aJ is the largest value of 
among all those with subscript y in the set S. Similar notation is used for the min 
operator. 

Scalars are presented in italic, vectors and matrices are bold. Thus x = (x,,x,,X 3 ) 
is a vector with three components. If the number of components is not important, a 
vector might also be abbreviated x = (x, ) . 

E{X) is the expected value of random variable X, and P{A) is the probability of 
the event A. For more detailed probabilistic notation, see Appendix A. 

Section 7.3.2 is the second subsection of the third section of Chapter 7, etc. 

Figures, tables, examples, and exercises are numbered sequentially in each chapter. 

References are accumulated at the end of the book, with referring sections or sub- 
sections given in [square brackets]. 




are “over everything. 



1.5 Book Overview 



11 



1.5 Book Overview 

The rest of this book comprises nine chapters and three appendices, each of which 
is previewed below. 



Chapter 2: Shooting without Feedback 

Fire is the main manifestation of military force and the major cause for combat at- 
trition. In this chapter we address the question of what happens when a weapon 
delivers rounds of fire on a target but gets no feedback regarding the effect. Issues 
of accuracy and damage are addressed in the cases of single and multiple shots, 
for single and multiple targets. While the majority of the chapter covers descrip- 
tive models, optimal shooting tactics are discussed too. 



Chapter 3: Shooting with Feedback 

This chapter deals with the situation where the shooter gets feedback about the 
status of the target (killed, partially damaged, unharmed) and possibly the miss 
distance. This feedback may be subject to error. In the presence of feedback, the 
question of optimal shoot-look-shoot (SLS) tactics arises. Both descriptive and 
prescriptive SLS models are presented. 



Chapter 4: Target Defense 

Consider a group of attackers approaching a target, hoping to kill it by overwhelming 
its defenses. The defense is armed with some anti-attacker weapons called intercep- 
tors, each of which can kill only the attacker to which it is assigned. The goal is to use 
the interceptors to maximize the survival probability of the target. Models, both de- 
scriptive and prescriptive, for the defense of single and multiple targets are presented. 



Chapter 5: Attrition Models 

The purpose of firing at the enemy is to cause attrition to his forces. In this chapter 
we present aggregate attrition models that apply to force-on-force situations. Other 
important ingredients in combat such as morale, endurance, maneuver, logistics, 
command and control, and intelligence are ignored or mentioned only in passing. 
The chapter focuses on Lanchester Models, both deterministic and stochastic, 
which are widely used in force-on-force modeling. 
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Chapter 6: Game Theory and Wargames 

Another major factor that affects the outcome of combat situations is the enemy’s 
behavior. The uncertainty associated with this behavior is different from the un- 
certainty that is associated, say, with the trajectory of a projectile. The projectile 
has no “feelings” or “opinions” about the combat outcome and its behavior is de- 
termined by Mother Nature, but the enemy has objectives and strong feelings 
about the outcome and behaves rationally. Thus, the uncertainty about enemy ac- 
tions deserves a different kind of treatment; decisions made against an enemy 
must differ qualitatively from decision made against Mother Nature. This chapter 
presents two types of models that address the case of a decision-making enemy: 
game theoretic models prescribe optimal strategies for both sides, while wargames 
are human-in-the-loop descriptive models (See Figure 1). 



Chapter 7: Search 

This chapter is about searching for physical objects with sensors that act very 
much like our eyes and ears. Similarly to our senses, the closer the sensor is to the 
object the higher is the probability of detecting it. Roughly speaking, search is a 
sequence of repeated trials where successes happen when the circumstances (e.g., 
distance, visibility, effectiveness of sensor) are such that the target is detected. 
This chapter reviews descriptive and prescriptive models for detecting the target 
as soon as possible. The models apply to both stationary and moving targets. 



Chapter 8: Mine Warfare 

A mine is a distinctive type of weapon; it is stationary once laid, its operation is 
triggered by the target, rather than the attacker, and it destroys itself in the process 
of attacking its target. Mines rely for their effectiveness on the enemy’s need to 
move, and they are most effective when operating as a group - a minefield. In this 
chapter we introduce simple minefield models and then proceed to discuss mine- 
field planning and minefield clearing. Game theoretic models are considered. 



Chapter 9: Unmanned Aerial Vehicles 

An unmanned aerial vehicle (UAV) is a remotely piloted or self-piloted aircraft 
that can carry a variety of payloads such as sensors, communication equipment, 
and even weapons. UAVs are increasingly becoming key weapon systems in modem 
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warfare. Typical missions of UAVs are surveillance, reconnaissance, target en- 
gagement and fire control for other long-range weapons, and attack. There are 
several problems associated with the design (size, velocity, payload, flying alti- 
tude, sensing capabilities) and operation of UAVs. This chapter only addresses 
operational issues. Specifically, two types of UAV problems are considered: (1) 
routing and scheduling UAVs in reconnaissance and search missions, (2) evaluat- 
ing the effectiveness of armed UAVs in attack missions. For the former, several 
optimization models are developed and demonstrated. For the latter, descriptive 
stochastic models are presented for evaluating the effect of such weapons in vari- 
ous engagement scenarios. 



Chapter 10: Terror and Insurgency 

Terrorism and insurgency are profoundly different from the combat situations dis- 
cussed thus far. First, they involve non-state actors that are organized and 
equipped differently than conventional forces and are not bound by any combat 
rules. Second, the general population, which only plays a passive role in conven- 
tional warfare, is much more involved in terrorist and insurgency actions both as 
victims (of terrorist acts) and active supporters or deniers (of an insurgency). This 
chapter presents three descriptive models. The first model evaluates the effect of 
suicide bombing in a crowded area. The second model describes the dynamics of a 
biological attack and the effect of various response strategies. The third model 
presents the effect of intelligence in counterinsurgency operations. 



Appendices 

Appendix A reviews basic probability concepts, probability distributions and 
Markov models. Appendix B introduces optimization models, and Appendix C 
discusses Monte Carlo simulations. 




Chapter 2 

Shooting Without Feedback 



One or two sarcastic spirits 
Pointed out to him, however. 

That it might be much more useful 
If he sometimes hit the target. 

From “Hiawatha Designs an Ex- 
periment”, a statistical poem by 
Maurice Kendall (1959) 



2.1 Introduction 

In combat, man specializes in exerting lethal force at a distance. The mechanism 
for exerting this force has progressed from stones to spears to firearms to rockets, 
hut the basic shooting problem for the marksman has always heen to effectively 
combine accuracy and lethality at long range. This chapter is devoted to abstract 
models of the shooting process. Such models have several possible purposes. One 
purpose is simply to determine the probability of killing the target for use in a 
higher level combat model. Another is to influence the design of a weapon system - 
this purpose will be aided by the dependence of kill probability on fundamental 
parameters that quantify accuracy and lethality. A third purpose is to influence the 
shooting process itself, since there are tactical decisions to be made about where to 
aim and how often to shoot. We assume throughout that the marksman gets no 
feedback between shots, so questions about how to adjust the aim point or when to 
stop shooting must be reserved until Chapter 3. 

The objective of the shooting process is to kill the target. The event “kill” may 
have several meanings such as “immobilize,” “damage,” “find,” or even “rescue,” 
but the only important thing is that the event should either happen to the target or 
not with each shot, with no residual effect if the target is not killed. 

Each shot is aimed at a certain aim point that the marksman controls, but the 
shot usually hits a different impact point because of dispersion errors that are as- 
sumed to be independent from shot to shot. The marksman may have a faulty no- 
tion of where the target is located, with the difference between the marksman’s es- 
timate and the actual target location being the target location error, one of several 
possible contributors to a bias error that affects all shots. We assume throughout 
that dispersion errors and bias errors are normally distributed in two dimensions. 
The world that we know has three dimensions, but most targets are known to lie 
on the surface of the Earth somewhere, which fixes the vertical dimension. 
Even problems that involve aerial or submarine targets have in most cases been 
reduced to two-dimensional problems by the invention of the proximity fuse. 
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2 Shooting Without Feedback 



While bivariate normality is not always the case in reality, it is usually close to the 
truth, except for outliers, and anyway the normal distribution should still be 
our default assumption because of the central limit theorem (see Appendix A). 

The plan of this chapter is to first deal with problems involving only a single 
shot, and then generalize to multi-shot problems. Section 2.2 deals with problems 
where there is only one shot. Section 2.3 generalizes to include many shots, as 
long as they are all independent. Section 2.4 deals with multi-shot problems where 
bias errors are significant and thus the shots are not independent. In such situations 
pattern firing is useful. 

It will be useful to have the Chapter2.xls ExceF^ workbook available. 



2.2 Single-Shot Kill Probability 



In this section we present some basic firing models that describe what happens 
when a single shot is fired. The shot will kill the target if, in spite of various errors 
in firing and in locating the target, the shot’s distance from the target is suffi- 
ciently small. Our object is to determine the probability of kill as a function of 
more basic parameters. 



2.2.1 Damage Functions and Lethal Area 

Definition: The miss distance r is the distance between the impact point of the 
shot and the location of the target. 

Definition: The damage function D{r) is the probability that the target is killed 
by the shot, as a function of the miss distance r. “Kill function” might be a more 
natural name, but our usage reflects common practice. Figure 1 shows three ex- 
amples of damage functions. 

The damage function has only one argument, so we are implicitly assuming a ra- 
dially symmetric situation where damage to the target is invariant to the position 
of the impact point, except for the miss distance. Damage functions are in practice 
measured through some combination of theory and experiment. In this book, we 
will take them to be given. 

The damage function is a conditional kill probability. The unconditional kill 
probability Pk is obtained by averaging over the miss distance. Let f(x,y) be the 
bivariate density function of the position of the weapon’s point of impact, relative 
to the target’s location. Then, since r = ^Jx^ + , 

Pk = f{x,y)dxdy , 



(2.1) 
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where the lack of limits means that the integral is to be taken over the whole 
plane. Sections 2.2.2 through 2.2.4 deal with various special cases of (2.1). 

If the position of the target were uniformly distributed within some large area 
A, then (2.1) would be (substituting /(v, y) = 1/d), 

Pk +>’^ ) dxdy , (2.2) 

A 

where the notation indicates that the integral is now taken only over the area A. 
However, since A is by assumption large, (2.2) is approximately the same as 
Pf, = a! A, where 

a = jj + y^) dxdy or (2.3) 

a = 27rjrD{r)dr . (2.4) 

0 

Formula (2.4) was obtained from (2.3) by introducing polar coordinates. The 
quantity “a” is the lethal area of the weapon and serves as a scalar measure of 
weapon effectiveness with respect to a certain target. 

Although it is not logically necessary, the damage function D{r) is typically 
a non-increasing function of its argument. As long as this is true, it is sometimes 
convenient to describe a damage function as follows: imagine that the weapon has 
a random “lethal radius” R associated with it, and that a target will be killed if and 
only if it lies within a distance R of the weapon’s point of impact. Recalling the 
meaning of D{r), it must evidently be the case that 

D{r)=P{R>r). (2.5) 

If D{r) is differentiable, one can go further and discover the probability density 
function off?: 

f,{r) = -^D{r). (2.6) 

dr 

The area covered by the weapon is kR^ , so it should come as no surprise that 
a = kE{R^) , where E{ ) denotes expectation. This provides another interpretation 
of the term “lethal area.” 



2.2.2 Cookie-Cutter Damage Function 

The conceptually simplest kind of weapon is one for which the lethal radius R is a 
constant, in which case the lethal area is a = nR^ . If the firing errors are circular 
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nomal (by which we mean that the standard deviation of the error in all directions 
is the same number <r) and centered on the target, then the two-dimensional den- 
sity function of the error is — ^-^exp(- ^ ) , and (2.1) reduces after the in- 

Ino 2(7 

troduction of polar coordinates to 

Pk = \ yexp(-^)r c/r c/6> = l-exp --R^/cr‘ . (2.7) 

This is a very simple expression for kill probability. A large number of military 
analyses have been based on it. Since (2.7) is a formula for the probability that the 
miss distance does not exceed R, it is also a formula for the cumulative distribu- 
tion function of the miss distance when firing errors are circular normal, a 
Rayleigh type of random variable. 




Dimensionless range 



alpha = 1 

alpha = 3 

alpha = 10 



Figure 1: The Diffuse Gaussiau (alpha=l) damage fuuctiou is cou- 
trasted with two more defiuite damage fuuctious with higher values of 
alpha. All three have the same lethal area. Dimeusiouless rauge is the 
ratio of miss distauce to the scale parameter h. 



Unfortunately, most departures from the circular normal assumption about er- 
rors result in significantly more complicated expressions for Pk- If the circular 
normal error distribution is offset from the target by some distance h, for example. 
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then evaluation of (2.1) involves the integral of a Bessel function that must be 
done numerically. One of the early products of the RAND corporation was a table 
of such probabilities (Marcum, 1950). Sheet “OffsetQ” of Chapter2.xls employs 
the function OffsetQ(BJ O, h/<7) to calculate the miss probability (the probability 
that the miss distance exceeds R) in this case. If the normal distribution is centered 
on the target, but not circular, then the function EllipQ{R, aj, oi) can be used in- 
stead. VBA code for both functions can be found in module 1 of Chapter2.xls. 
The code is based on Gilliland (1962), which also covers the general case where 
the normal distribution is neither centered nor circular. 

Example 1: If = 100 m and a = 50 m, the kill probability according to (2.7) is 
0.865. If the aim point is offset from the center of the distribution by = 25 m, 
then the miss probability is OffsetQ{2, 0.5) = 0.169, so the kill probability is only 
0.831. The bigger the offset, the smaller the kill probability. See sheet “OffsetQ” 
in Chapter2.xls for more calculations of this kind. 

Example 2: Suppose 7? = 100 m, but that the standard deviation of the error along 
the line between marksman and target (the downrange direction) is 90 m, 
while the standard deviation of the error in the direction perpendicular to that (the 
crossrange or deflection direction) is only 30 m. It is typical for downrange 
and crossrange errors to differ by about a factor of 3, as assumed here. The miss prob- 
ability is EllipQ{m, 90, 30) = EllipQim, 30, 90) = 0.293, so Pk= 0.707. An ap- 
proximation can be made by letting the error in both directions be whatever is re- 
quired to preserve the product of (30 m)(90 m) = (2700 m^) . That error is 

a = 51.96 m. Employing (2.7), we find that Pk= 0.843 and see that it can be sig- 
nificantly optimistic to assume that errors are circular when they are not. Sheet 
“EllipQ” of Chapter2.xls generalizes this example. 

Formula (2.7) is sometimes expressed in the form 

=l-(0.5)*"'^™‘’'’, (2.8) 

where CEP stands for “circular error probable.” 

Definition: Circular Error Probable (CEP) is the radius of the smallest circle 
that contains the two-dimensional error with probability 0.5. 

In (2.8), CEP references a circular normal firing error, but the definition of CEP 
applies to any kind of a two-dimensional error distribution. For a circular normal 
distribution, CEP is related to <Tby CEP = <7^2 In 2 = 1.1774<r(see Exercise 14). 

Dealing with CEP instead of a has the advantage of easing explanations to 
novices, since the idea of standard deviation is not involved. CEP can itself 
be confusing, however. The corresponding notion in one dimension would be an 
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interval that contains the error half the time, the half-length of which is sometimes 
called the linear error probable or LEP. A novice might expect LEP and CEP to be 
equal, since each corresponds to a region that contains half the shots, but they are 
not. A square with side 2 x LEP will contain only (0.5)^ of the shots, since both 
coordinates must fall within the relevant side of the square. A circle with radius 
LEP, since it falls within that square, will contain even less than 25% of the shots. 
CEP is thus considerably greater than LEP, and SEP (spherical error probable) is 
larger yet. In other words, the magnitude of the “error probable” must depend on the 
number of dimensions that one is interested in. It is simpler to say that the error has 
standard deviation a along any line, regardless of the number of dimensions. 



2.2.3 Diffuse Gaussian (DG) Damage Function 



The DG damage function has the form D(r) = exp 



2 ^ 






for some scale factor 



b. Unlike the cookie -cutter damage function, there is a positive probability of kill- 
ing the target no matter what the miss distance. The lethal area of such a weapon 
is Figure 1 compares D{r) for a DG damage function with two other damage 
functions that will be discussed in the next section. The weapon with the DG func- 
tion is evidently “sloppier” than the others, in the sense that results at any given 
range are harder to predict. Whether this feature makes the DG assumption more 
realistic depends on the damage mechanism. Weapons that kill by fragmentation 
(e.g. artillery using “fragmentation” rounds) generally have a sloppier damage 
function than those that kill by overpressure (e.g., artillery using high explosive 
rounds). The cookie-cutter damage function is the least sloppy of them all because 
it suddenly falls from 1 to 0 as the miss distance increases. 

The DG assumption combines very nicely with the assumption of normal er- 
rors to produce a simple, general expression for Pf^. If the center of the error distri- 
bution is (jilx, Py), and if the standard deviations of the X and Y errors are (oy, oy), then 
(2. 1) can be evaluated analytically. In fact, (2.1) can be integrated in closed form even 
when the damage function is not radially symmetric, so we record here the result for 

r r , 



the asymmetric DG damage function: IfD(x, y) = exp 



Pk = 



h^by 



'i{bl+(yl){bl+(yl) 



exp 



pI 



i4 



V 






r 

y_ 



bi +(7l 






then 



(2.9) 



In the special case where jJix = bx= bY= b, and oy = ay = o; (2.9) reduces to 

b^ 



P.= 



b^ + a^ 



(2.10) 
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which is comparable to (2.7). 

There is no cookie-cutter counterpart to (2.9); that is, there is no simple ana- 
lytic expression for the cookie-cutter kill probability with the generality of (2.9). 
While it is true that the cookie-cutter damage function is conceptually simpler than 
the DG damage function, it is equally true that the DG function is analytically 
simpler than the cookie-cutter function. The simplicity of the DG assumption was 
first taken advantage of in 1941 by von Neumann (Taub, 1962), who used it in de- 
termining optimal bomb spacing in World War II. 

The “DGGenrl” sheet of Chapter2.xls calculates Pk from (2.9), as well as per- 
forming a Monte Carlo simulation whose object is to estimate the same quantity. 
The agreement of the two answers is part of the process of verifying that each is 
correct. 



2.2.4 Other Damage Functions 



It was pointed out in Section 2.1 that any non-increasing damage function can be 
interpreted as the probability law for a random lethal radius R. The Diffuse Gaus- 
sian damage function, for example, has associated with it the density function 

r 2 ^ 



A('') = ^exp 



2^ ’ ^ Rayleigh density function. It is perhaps more 



natural to deal with the random variable Rr, since R^ is directly related to area 
covered. For the DG damage function, is an exponential random variable with 
mean 2b^. 

It is possible, of course, to reverse the process: begin with some convenient 
density for R or and then discover the associated damage function by integra- 
tion. One convenient class of damage functions (the Gamma class) can be ob- 



tained by assuming that —R?lb^ has the Gamma density 

2 T{a) 



-exp(-ax) for 



some a> 0, in which case the DG damage function is the special case 1 and 
the cookie -cutter damage function is obtained in the limit as a — » Every mem- 

ber of the class has the same lethal area 2;rb^. The associated damage function is 

D„(r) = l-r[a,^j (2.11) 



where T{a,x) is the incomplete Gamma function (in Excel™, T{a,x) is 
GAMMADIST(x, a, 1, TRUE) — see Figure 1 or sheet “Gamma” of Chapter2.xls 



for plots of (r) versus 
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The Gamma class is convenient because it has both scaling (h) and shaping 
(a) parameters, and also because there is a simple formula for Pk when the firing 
error is circular normal with standard deviation crand centered on the target: 




«> 0 . 



(2.12) 



Formula (2.10) is the special case where 1, and (2.7) is the limiting case as 
» oo. Except in the case a = 1, where (2.9) applies, the centered circular normal 
assumptions are essential for having a simple expression for 

Another class of density functions for with both a shaping and a scaling 
parameter is the class of lognormal densities. There turns out to be little to rec- 
ommend this class in terms of analytic convenience - there are no counterparts to 
(2.11) and (2.12), for example. Nonetheless, the class has been widely used to 
model the effects of nuclear weapons (DIA, 1974). 



2.3 Multiple-Shot Kill Probability 



In this section we consider multiple shots made in a salvo. 

Definition: Regardless of the number of targets, a “salvo” is a group of shots all 
taken on the basis of the same information, with no information feedback between 
shots. We often imagine that the shots are all made at the same time, even though 
that is not necessarily the case. 



2.3.1 Simultaneous Independent Shots 

Suppose that a salvo of n independent shots is fired at a target, and let be the 
probability that the ith shot fails to kill it. The numbers qi may be obtained from 
one of the formulas in Section 2.2 or by some other method. Since all shots are by 
assumption independent, the probability that all n miss the target is the product of 
the miss probabilities, so 



PK^l-qiqi...qn- (2.13) 

In the case where all the shots have the same miss probability q, this reduces to 
= I — q" , a formula that is used so often that it has a name “powering up.” If 
each of three independent shots has a kill probability of 0.3, one might naively ex- 
pect the kill probability of the collection to be 0.9. The correct answer, however, is 
1-0.7^ = 0.657. The miss probability should be powered up, rather than making 
the kill probability proportional to the number of shots. 
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Formula (2.13) takes on a particularly simple form if the shots are all of the 
cookie-cutter type, and the firing errors are circular normal centered on the target. 
Let Rj and o; be the lethal radius and error standard deviation of the shot. Then 



= exp 




from (2.7), and therefore 



Pjt = 1 - exp(-X/2), where (2-14) 

X = Rflaf+-+Rl/al. 

The quantity X can be thought of as a measure of the effectiveness of an arse- 
nal of weapons against a particular target. The target dependence of this effective- 
ness can be eliminated if lethal radius scales in a known manner with the energy 
yield Y of the weapons. If the kill mechanism is overpressure, for example, then 
R. = where K is a target-dependent constant, and therefore 

X = K^[Y,^/^/af+... + Y„^P/al]. The quantity in [ ] is a target-independent 

measure of effectiveness for the group of weapons taken as a whole. It differs 
from “counter military potential” (CMP, see below) only in the scale factor re- 
quired to convert standard deviation to circular error probable (CEP) for circular 
normal weapons (see Section 2.2.2). 

The CMP of a group of n weapons is 

CMP = Y,^'^ /cep' + • • -I- F/''' /cep/ , (2.15) 

where yield is measured in equivalent kilotons of TNT, and CEP is measured in 
nautical miles. CMP is one of several measures that have been used to compare 
arsenals of nuclear weapons. This measure is very sensitive to accuracy - dou- 
bling all yields increases CMP by the factor 2'^^ = 1.6, whereas halving all CEP’s 
increases CMP by the larger factor 2' = 4. In the 1970s, this fact was sometimes 
used to make the point that the relatively small but accurate nuclear arsenal of the 
United States was actually more potent than the large but inaccurate arsenal of the 
Soviet Union. Tsipis (1974), for example, estimated that CMP was 22000 for the 
US and 4000 for the SU in 1974. 

An alternative measure of effectiveness for an arsenal is “equivalent mega- 
tons” (EMT). The definition of EMT is 

EMT = T;'/'-i---i-T/'''. (2.16) 



Since Y^'^ is proportional to 7?/ , EMT is essentially a target-independent measure 
of the total lethal area of the arsenal. The “EMTCMP” sheet of Chapter2.xls com- 
pares the 1978 ICBM arsenals of the USA and the USSR using both measures. 

If a total of C units of CMP are applied to a target, then the kill probability is 
of course still a function of the hardness of the target. For nuclear weapons making 
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overpressure kills, with hardness h being measured in pounds per square inch 
(psi), an approximate formula (Matlin, 1972) valid for 30<h< 1000 psi is 

Pk=1- exp(- 0.0583Cr“'^). (2.17) 

For example, a 1000 kiloton weapon with a CEP of 0.25 nautical miles will kill a 
1000 psi target with probability 1 - exp(- (0.0583)(400)(0.00794)) = 0.52. Sixteen 
such weapons would be equally effective if the CEP were 1 nautical mile. 

Figure 2 shows an application of the EMT idea to various historical ships, ex- 
cept that EMT is replaced by E8RPM. E8RPM is the equivalent rate of applying 
8" rounds per minute, plotted against firing range for each of five ships. Since 
yield (Y) is proportional to weight, and weight to the cube of dimension, Y is 
proportional to dimension squared. Thus one 16" round is roughly equivalent to 
four 8" rounds, etc. Reducing all firepower to a common scale permits each ship 
to have a single “weight of broadside” versus range curve, independent of the tar- 
get. Aircraft carriers can be portrayed on the same scale by calculating the rate at 
which aircraft can deliver 500-pound bombs, each of which is equivalent to two 8" 
rounds. At a range of 20 nm, any of the battleships shown in Figure 2 could have 
quickly sunk the USS Enterprise. The battleship’s problem in World War II was 
that an aircraft carrier could deliver enough firepower, even at long range, to put a 
battleship out of commission before any of its guns could come into play. 
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Figure 2: Equivalent 8" rounds per minute for five historical ships. 
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Figure 2 does not tell the whole story, of course, since no information about 
accuracy or armor is given. Figure 2 does make it clear that the USS Enterprise 
and the USS New Jersey were very different weapon systems, and that the passage 
of 35 years produced an aircraft carrier much more powerful than the Enterprise. 



2.3.2 Salvos of Dependent Shots 



The firing errors dealt with in the previous section were dispersion errors, 
whereas in this section we assume the additional presence of a bias error. 

Definition: Dispersion errors are firing errors that are independent and identi- 
cally distributed among multiple shots. Such errors are sometimes also called bal- 
listic errors. 

Definition: Bias errors are errors common to all shots. Such errors are sometimes 
also called systematic errors or aiming errors. 

Bias error might be due to a misalignment between the aiming and launch- 
ing systems, to an error in target location, or to any other effect that introduces 
an error component common to all shots. The result is frequently that the impact 
points relative to the target are tightly grouped (indicating small dispersion er- 
rors) but in the wrong place due to bias, as in Figure 3. One can think of the bias 
error as being the center of gravity of the group, and of the dispersion errors as 
being deviations from the center of gravity. We shall use the notation that 
(<T( 7 , <Tk) are the (horizontal and vertical, say) standard deviations of the bias er- 
ror, whereas the independent dispersion error for each shot has standard devia- 
tions {ax, ay). 
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Figure 3: Illustrating a patteru with small dispersiou aud large bias. This 
marksmau might be more effective if he had a larger (!) dispersiou. 

It is no longer possible to proeeed by first finding the single-shot kill prob- 
ability and then invoking (2.13) to obtain a simple expression for Pk, sinee the re- 
quired independenee assumption is falsified by the bias error. The following ex- 
ample should make this elear. 

Example 3: Suppose there are two shots, and eonsider a one-dimensional prob- 
lem where the bias error is X and the dispersion errors are Y\ and Y 2 for shots 1 
and 2. To keep the eomputations simple, assume that all errors are either —1, 0 
or -1-1, rather than being normally distributed, and that the target will be killed if 
and only if the sum of the errors is 0 for some shot. Thus, the probability of kill- 
ing the target with two shots is (2) = P{X -b 1^ = 0 ox X + Y^=Q) . Assume that 
X is equally likely to be any of the three possibilities, whereas Y\ and Y 2 are in- 
dependent random variables with probabilities 1/6, 2/3, and 1/6 of being -1, 0, 
or -bl. Using the theorem of total probability, the single-shot kill probability is 
P^(l) = P(JT-bl^ =0) = (l/3)(l/6)-b(l/3)(2/3)-b(l/3)(l/6) = l/3. Eaeh of the 
two shots will kill the target with probability 1/3. If we were to power up the 
miss probability, the kill probability for two shots would be 
Pk{2) = 1 - (1 - (1))^ = 5 / 9 . However, powering up is not justified in this case 

because the two shots are not independent - they share the common error X. 
The correct answer can be obtained by conditioning on the value of X\ that is, by 
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powering up for each of the three possible values of X, and then averaging: 
i\,(2) = (l/3)(l-(5/6)") + (l/3)(l-(l/3)") + (l/3)(l-(5/6)") = l/2. It is typical 
for the faulty computation to produce a kill probability that is too high. 

In general, let (U,V) be the two-dimensional bias error, and let f{u,v) be its 
density function. If there are n shots and Q (u, v) is the miss probability of the ith 
shot, given that the bias error is (u,v), the expression for Pk is 

=E{\-Y[QXU,V)) = \\{l-f[QXu,v))f{u,v)dudv (2.18) 

/=] /=! 

We will have little direct application for (2.18), the main reason being the compli- 
cated nature of the functions Q (m,v) , which in general will depend on the aim 
points for each of the n shots. In fact, we will find no simple exact expressions for 
Pk in this section. The best that can be hoped for, other than solutions to specific 
problems that are important enough to justify the work involved in evaluating 
(2.18) for all aiming patterns of interest, is some rules of thumb that take the form 
of approximations. In deriving these approximations, it will be convenient to 
imagine that the only source of bias is an error in target location. The approxima- 
tions are actually valid regardless of the source of bias or even if there are several 
sources (see Section 2.3.4). 

Our first approximation to Pk is an upper bound obtained by making two un- 
realistic assumptions that are clearly favorable to the marksman. One assumption 
is that there are no dispersion errors, and the other assumption is that the marks- 
man can exchange his weapons for any other weapon or weapons with the same 
total lethal area. The second assumption permits the marksman to avoid all of the 
problems associated with trying to fill up space with circles, since he can reshape 
the lethal area in any convenient manner. In the circularly symmetric case where 
(7u= <Tv~ <y, the marksman would prefer to have a single large cookie-cutter 
megaweapon that he would aim directly at the target or more precisely at the mean 
location of the target. If the total lethal area of n weapons is na, then the lethal ra- 
dius of such a megaweapon would he R = -^na/n , and the resulting kill probabil- 
ity would be (from (2.7)) l-exp(-7?‘/2cr*) = 1 - exp (- «fl/(2;rcr* )). More gen- 
erally, the best megaweapon for our privileged marksman is a cookie-cutter with 
the same elliptical shape as the iso-probability contours of the error distribution. 
The resulting bound is 
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Pjc < 1 - exp(-z), where z = . (2.19) 

'2.7Z(7jj (7 Y 

Formula (2.19) was obtained by essentially assuming away all the overlap 
that is caused by dispersion errors, circle -packing problems, and noncookie-cutter 
weapons. The expression 1 - exp(-z) should therefore be expected to be an accu- 
rate approximation in circumstances where overlap is expected to be a minor prob- 
lem. Seven circles, for example, pack rather nicely into one circle without very 
much overlap. Figure 4, which is taken from the “Patterns” sheet of Chapter2.xls, 
shows the upper bound lying well above two other approximations that are intro- 
duced below. 




UpBound 

SOLR 

SULR 



Figure 4: The two locally random approximations, SULR and SOLR, lie well 
below the upper bound on kill probability. 

A different kind of approximation is based on the idea that overlap is inevita- 
ble, and that one should expect the amount of overlap to be whatever happens “at 
random.” More precisely, the total lethal area na is assumed to be in effect so 
much confetti, with the marksman being able to control the density of confetti on a 
large “strategic” scale, but not the small-scale tendency of the flakes to overlap 
one another. Now, if d square inches of confetti are scattered uniformly at random 
on a one-inch square or in other words if the coverage ratio is d, then the fraction 
of the square that is covered is 1— exp(- d), as long as the flakes are sufficiently 
small. This expression is most easily derived by imagining that there are k inde- 
pendently located flakes of confetti with total area d. The probability that each 
flake covers the target is d!k, the fraction of the square covered by the flake. By 
powering up, the probability that some flake covers the target is \—{\—dlkf. The 
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limit of this expression as k approaches infinity is 1— exp(-<i). This is trae even if 
the density of confetti {d) depends on location. The marksman’s problem is then to 
distribute a fixed amount of confetti {no) over the plane in such a manner that the 
(unconditional) kill probability is maximized. 

Assume that <T(/ = (Tv= <7, and that the marksman scatters all na units of con- 
fetti uniformly over a circle with radius r in the hope that some flake covers the 
target. This is the strategically uniform, locally random (SULR) case. Within the 
circle, the coverage ratio is d = na!{n:r^) . The probability that the target is actu- 
ally in the circle is (from (2.7))l-exp(-r^ / 2 <t^) , so the unconditional kill prob- 
ability is 

p{r) = [l-exp(-r^/2cr^)] [l-exp(-na/;rr^)] . (2.20) 

The first factor in (2.20) is 0 if r = 0, while the second is 0 as r — > °o, so there must 
be a maximizing value for r. The value turns out to be r = o(4z)*^"', where 
z = nal{2K(^), as can be verified by showing that {d/dr)p{r ) = 0. Upon substitut- 
ing r into (2.20), one obtains the strategically uniform locally random (SULR) 
formula 

Pk = P(f*) = (l-exp(-Vz))" . (2.21) 

Formula (2.21) also holds when au^ (Tv, provided that z = «a/(2;r(7j^(7^) and that 
the confetti is scattered uniformly over an optimally sized ellipse. Figure 4 shows 
that the SULR formula (2.21) provides a much smaller estimate of Pk than does 
(2.19). 

The final approximation is the same confetti approximation, except that the 
coverage ratio is allowed to be any function d{x, y) of two spatial coordinates, sub- 
ject of course to being nonnegative and to the constraint that the total amount of 
confetti used must be na. This includes the case where d{x, y) is constant within 
some region and 0 outside it, so we should expect the current approximation to be 
larger than (2.21). This is the strategically optimal, locally random (SOLR) case. 
Formally, the optimization problem is 

maximize l|/( T,y )[> -exp(-(i {x,y))^dxdy 

subject io d{x,y)>Q for all x, y (2.22) 

and II d {x,y) dx dy = na, 



where / (x, y) is the bivariate normal density function with standard deviations 
((7u, (Ty). The solution can be found in Morse and Kimball (1950, Chapter 5), to- 
gether with a discussion of how the optimal coverage ratio function d\x,y) can 
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be used as a guide in designing effective patterns. The optimal function d"" {x,y) 
is 



d* {x,y) 








(2.23) 



where the + indicates that d {x, y) is to be 0 rather than negative, and where as 
usual z = na l{2na^ay) . Note that the confetti should be most dense at the origin, 
with the density falling off gradually to 0 on the (8z)’ -standard deviation ellipse. 
Outside of this ellipse there should be no confetti at all. The result of substituting 
d (x,y) into the objective function is the SOLR formula 



Pj. =l-(l-i-%/^)exp (-V^) , (2.24) 

The SOLR formula is also shown in Figure 4. There is not much difference be- 
tween (2.21) and (2.24). Once the total lethal area has been conceptually reduced 
to confetti, it turns out not to be crucial that its distribution be exactly (2.23). 

Example 4: Suppose that there are four weapons with cookie-cutter damage func- 
tions, with R = 7.5, and that the error standard deviations are aj/= (7^= 7.5, = 

(Ty = 1 . By exhaustive trial and error computations, it can be determined that the 
exact best pattern is a square of side 1 1 .7, and that the associated kill probability is 
0.80. Sincez = 4.^(7.5)^ /(2.^(7.5)^) = 2 , the three approximations are 0.865, 
0.594 (SOLR), and 0.573 (SULR). The upper bound is considerably closer to the 
tmth than either of the confetti approximations. The confetti approximations can 
be made to look better by letting the weapons be diffuse Gaussian with the same 
lethal area, in which case the approximations do not change but exact computa- 
tions reveal that the best is only 0.69, achieved by aiming the four weapons in a 
square of side 10. If the dispersion error is in addition increased from 1 to 5, the 
approximations still do not change, but the best possible Pk decreases to 0.62. 

Since neither oy, (Ty, nor any feature of the damage function other than lethal 
area enters the computation of z, it is clear that one could find cases where the ac- 
tual kill probability is even smaller than the confetti approximations. In fact, one 
has only to consider any problem where the shots are nearly independent, since 
z = CO when Cu or <Ty is 0. In problems where the bias errors dominate the disper- 
sion errors, however, the confetti approximations can usually be thought of as 
lower bounds on P^. 

There is one other bias-compensation technique that is worthy of mention. In- 
stead of aiming the several shots in a pattern, one can simply aim them all at the 
target, but inaccurately. A shotgun blast is an example of this; the marksman aims 
at only one place, but the shotgun pellets form a random pattern around it. Sheet 
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“SULR SOLR” of Chapter2.xls demonstrates this for the SULR and SOLR ap- 
proximations by generating the shot locations randomly from the appropriate den- 
sity, and then testing whether any of the shots actually kills the randomly located 
target. This is a Monte Carlo simulation (Appendix C). If the deliberate inaccuracy 
itself takes the form of a bivariate normal error for each shot, the technique goes 
by the name of “artificial dispersion” and is amenable to analysis because only the 
sum of artificial and real dispersions is important. Significant work on determining 
the right amount of artificial dispersion was performed during World War II in 
Russia (Kolmogorov, 1948). The overall effect should be similar to one of the 
confetti approximations. 

Given all the above considerations, we offer the following procedure for ob- 
taining an approximate Pk in the general case where both bias and dispersion are 
present: 

(a) If dispersion dominates bias, determine the “equivalent” dispersion stan- 
dard deviations cr)( = and a'y = ^JcTy + for each shot, and 

then use (2.13) to obtain an approximate Pk- 

(b) If bias dominates dispersion, and if the “packing problem” can probably 
be solved without much overlap (nearly cookie-cutter weapons, disper- 
sion small compared to lethal radius as well as bias, etc.), use (2.19). This 
is an upper bound. 

(c) If bias dominates dispersion, and if it is clear that the best pattern will in- 
volve substantial overlap, use one of the confetti approximations. 

The above rules are not exhaustive, since there are certainly cases where nei- 
ther type of error dominates the other, and in any case the resulting estimate of Pk 
is only an approximation. An accurate Pk can only be obtained by evaluating (by 
Monte Carlo simulation, for example - see Appendix C and Exercise 8) suffi- 
ciently many patterns to be sure of having discovered the best one. 



2.3.3 The Diffuse Gaussian Special Case 

The case of the DG damage function with normally distributed errors is excep- 
tional in that Pk can be evaluated analytically for any given pattern of shots, even 
when the shots have different parameters. As a result, it is possible to evaluate and 
even optimize a given pattern using a spreadsheet and some VBA code. This is an 
important enough capability that we include the required mathematics in this sec- 
tion, but the reader may wish to skip it in favor of sheet “DGPattn” of Chap- 
ter!. xls, which employs the mathematics in the form of VBA code, or to refer to 
the expanded and slightly generalized treatment in Washburn (2003a). 

Formula (2.9) is a product of two factors, one depending on X-parameters and 
the other on Y -parameters. Let 
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K(h,ju,a) 






exp 



1 u" 

2b^+a- 



(2.25) 



Then (2.9) states that = K(b^,ju^,aj^)K{by,jUy,ay) . Now, let n shots be 
aimed at (x„ j,), If (U, V) is the bias error common to all shots, then the 

ith shot is offset from the target by {jJL^,lJly) = (x, +C/,y, +V ) , and the kill prob- 
ability for the ;th shot is therefore {U, V) = K{b^ , x. + U, a^y )K (by^ , y. + V, ). 

Here we have changed the name of the product because we wish to reserve the 
symbol P^ for the kill probability of the whole collection of n shots. Given the 
bias error, the shots are independent and we can employ (2.13), so 



\-P^=E{Y[{1-P>{U,V)}). (2.26) 

;=1 

In (2.26), which is an application of the theorem of total probability, the ex- 
pected value is with respect to the normal distribution of the common bias error 
([/, V). Note that each shot can potentially have distinct parameters for dispersion 
and lethality, with bxi and oy, being the lethality and dispersion of the ith shot in 
the X-direction, etc. 

The first step in evaluating (2.26) is to expand the product into a sum of 
terms, each of which is associated with one of the 2" subsets of the n shots. For the 
subset S, the term is 

ts^EiY[PXU,V)). (2.27) 

ieS 



The plan is to analytically evaluate each of these terms, and then sum up all 2" 
of them, paying proper attention to signs. Our first observation is that, since U 
and V are independent errors, and since Pj{U,V) can be factored into an X-part and 
a T-part, so can ts. In fact, let 

t{b,\l,o,s,S)^E{Y{K{b„^l,+W,a,)) , (2.28) 

i^S 



where b, ct, and p, are «-vectors indexed by shot number, and the expectation is 
taken with respect to a normal distribution of W that has standard deviation s. 
Then = t{\) ,y,ayj,S)t{\)y,'s.,(Sy,ay,S) . Here ={b^^,...,b^ ) , and simi- 
larly Ox, X, by, Oy, and y are also «-vectors. The central problem is now to evaluate 
t(b, p, a, s, S) , since this will lead to the evaluation of Pk through (2.26). 
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Let M, = Kb^ +0-f ); A: = 0, 1, 2 , let C = ]J 



definition ofiT() in (2.25). The product in (2.28) is 



3/^ 



2 , ^2 
+ ( 7 , ^ 



and recall the 



Yl K{b, ,lil, -I- If , (7,- ) = C exp I - - (If -b 2WM^ + M, ) 



(2.29) 



The expectation of (2.29) with respect to W can be accomplished analytically 
because (2.29) involves a quadratic expression in W lying within an exponential. 
The result is 



t(b,p,n,5,5) 



C 






rexp 







(2.30) 



Calculating Pk is now just a matter of going through the steps outlined earlier. 
One can even include a shot reliability by modifying the C-factor. An implementa- 
tion of the method outlined above is the VBA function PK() included with Chap- 
ter2.xls. The method is essentially that of Bressel (1971), who goes on to take the 
additional step of averaging over a rectangular target. The PKQ} function uses 
double-precision arithmetic because the series being summed alternates in sign. 
See Grubbs (1968) for alternatives with better numerical stability. 

Since Pk can be evaluated analytically as a function of the aim point vectors x 
and y, we can now consider the problem of finding the pattern that will maximize 
Pk- This is a nonlinear optimization problem. On sheet “DGPattn” of Chap- 
ter2.xls, Excel’s Solver is employed to accomplish this. 



2.3.4 Area Targets and Multiple Error Sources 

Section 2.3.2 is often applicable even when there are multiple sources of error. 

Suppose, for example, that 

(a) the location of a target relative to some known datum is i?i. 

(b) all shots are to be fired from a platform whose location relative to the 
same datum is subject to an error Ej. 

(c) each shot has an individual firing error due to trembling on the part of 
the marksman. 

(d) an additional firing error is introduced due to an unknown wind velocity 
E,. 

(e) El, E 2 , E 3 , and E 4 are all independent, normal random variables with 0 
mean and variances af,i=l,2,3, 4. 
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It is necessary to classify each of the four errors as either “bias” or “dispersion.” 
E\ and E 2 are clearly bias errors, since we assume that the positions of the target 
and the firing platform are the same for each shot. Et, is clearly a dispersion error, 
since each shot has an independent dispersion that is different from all the rest. E 4 
might be a bias error if the unpredictable part of wind velocity were constant in 
space over the length of time required to fire the shots (the predictable part is ir- 
relevant, since the marksman could allow for it in aiming) or it might be a disper- 
sion error if the wind were very gusty. Assume the latter. Then, making the natural 
assumption that the four error types are independent of each other, and noting that 
it is only the total bias error and the total dispersion error that affect the fate of the 
target, the equivalent bias and dispersion error variances are of + o\ and 

o\ +o\, respectively, and Section 2.3.2 can be applied to the equivalent errors. 
The principle being used is the theorem that the variance of a sum of independent 
random variables is the sum of the variances. 

All of the above applies to targets whose only property is a location; i.e., to 
point targets. However, it is not difficult to handle area targets within this scheme, 
provided we are interested only in the average fraction of the target killed. In 
(2.18), if /(m,v) is interpreted to be the amount of target value per unit area at 
point (m,v), then Pjf has the meaning “total amount of value killed, on the aver- 
age.” Furthermore, if we normalize /(m,v) so that the total target value is unity, 
then Pk means “average fraction of the target killed,” or, equivalently, the prob- 
ability of killing a test element of the target, where the location of a test element is 
governed by the density function f{u,v) . In other words, the test element location 
{U,V) can be regarded as just another bias error, to be combined with any other 
bias errors as necessary. Any area target can be handled by converting the value 
density of the area target to an equivalent density function of a bias error, and then 
proceeding as if the target were a point target. This is especially easy to do, of 
course, if f{u,v) happens to be bivariate normal. 

Example 5: Suppose that {U,V) is circular normal with standard deviation 80 ft; 
that is, assume that the target’s value is spread out in a bell-shaped manner. Also 
assume that Ei, E 2 , E 3 , and E 4 are all circular normal with standard deviations 10 , 
20, 30, and 40 ft, respectively. Assuming that the wind error is dispersion, the 

equivalent dispersion is oy = (7y = ^30^ -i-40^ = 50 ft, and the equivalent bias is 

au= Ov = VlO^ -1-20^ -1-80^ = 83 ft. One could now proceed as in Section 2.2, 
probably by ignoring the dispersion error and using the SOLR formula to estimate 
Pk, which is now interpreted as the maximum possible expected fraction of the 
target killed by an optimal pattern. If the pattern consists of 20 shots with lethal 
radius 10 ft each, we find that z = 20(10)^7(2(83)^) = 1/69. The resulting Pk is very 
small, even according to the optimistic (2.19). Twenty shots of this size are simply 
not capable of doing much damage to a target as spread out as the one postulated. 
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The fact that area targets introduce an effective bias error is important in de- 
termining whether CMP or EMT is a better measure of effectiveness for an arsenal 
of weapons (see Section 2.3.1). Since (2.14) was derived under the assumption 
that the only firing error was dispersion, we can say that CMP is the proper meas- 
ure if the targets are point targets and if the bias errors are negligible. If the effec- 
tive bias (including the effects of target size) dominates the effective dispersion, 
however, then EMT is more appropriate. Thus (to conclude the comparison that 
was begun in Section 2.3.1), the United States nuclear arsenal in 1978 was more 
effective against well located, hard targets such as ICBM silos, but the Soviet Union 
arsenal was more effective against cities, which are well-located area targets, or 
against submarines, which are poorly located point targets. Dispersion is almost irrele- 
vant for either of these latter target types, even though it is crucial for the former. 



2.4 Multiple Shots, Multiple Targets, One Salvo 



In this section we consider situations where there are multiple targets, all of which 
must be attacked “simultaneously” in the sense that no feedback about results is 
available between shots (the next chapter considers feedback). If several shots are 
available, the question of how they ought to be distributed over the targets arises. 



2.4.1 Identical Shots, Identical Targets, Optimal Shooting 

If all shots in a salvo are identical and all targets are identical, then any reasonable 
measure of effectiveness will be maximized when the shots are spread as evenly 
as possible over the targets. If the number of shots is an integer multiple of the 
number of targets, then the number of surviving targets is a binomial random vari- 
able, since all targets have the same survival probability. The situation is some- 
what more complicated when the number of shots is not an integer multiple of the 
number of targets, since some targets are shot at one more time than others. Sup- 
pose there are b shots and n targets, with each shot having a miss probability q 
against its target. Let k be the integer part of bln, so that some targets are shot at k 
times while others (r of them) are shot aik+\ times; r is the number of shots left 
over after every target is shot at k times, so r = b — kn. The total number of surviv- 
ing targets is the sum of two binomial random variables, the number of survivors 
from the r targets shot aik+\ times and the number of survivors from the n—r tar- 
gets shot at k times. The distribution of the total number of survivors X is therefore 
the convolution of two binomial distributions. The probability mass function ofX 
is given by 
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(2.31) 



where Bin(y; t, p) is the binomial probability of y successes out of t trials when 
the success probability is p. The average number of survivors is 



The function SurvQ is included in module 1 of Chapter2.xls. 

Example 6: Suppose that « = 3 targets are attacked hy b = A weapons, each of 
which has an individual miss probability of ^ = 0.6. Then k = \ and r=\, so one 
target is attacked twice while the rest are attacked only once. The probability of 
three survivors is 0.1296, since all four shots must miss if all targets are to 
survive (the sum in (2.30) contains only the term for j= 1). The rest of the probabil- 
ity distribution of X can best be obtained using the SurvQ function defined in 
(2.30). From (2.31), the average number of survivors is 1.56. See sheet “Surv” of 
Chapter2.xls. 



2.4.2 Identical Shots, Diverse Targets, Optimal Shooting 

Flere we consider the case where the shots are all identical, but not the targets. 
This kind of situation is roughly the case in planning a nuclear first strike, so con- 
siderable work was done on the problem during the Cold War. 

When targets are diverse, there are many possible measures of effectiveness. 
We will assume that each target has a given value, and that the marksman’s goal is 
to kill as much value as possible, on the average. Determining target values is in 
practice problematic, but nonetheless necessary if an optimal attack is to be 
planned - only scalar measures can be optimized, and assigning values is the most 
direct way to put all targets on the same scale. Let v, be the value of target j, and 
let qj be the probability that one shot at target j will fail to kill it (the kill probabil- 
ity is 1-^^). 

Since all shots are assumed to be independent, the probability that Xj shots 
will fail to kill target j is q^^‘ . If there are n targets and b shots in total, this leads 

to the problem of minimizing the total average surviving value, subject to not us- 
ing more shots than are available, which we name problem PI : 



E{X) = rq'‘^' +{n-r)q\ 



(2.32) 



n 



Minimize , 



(2.33) 
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subject to '^Xj < b , and all variables Xj must be nonnegative integers. 

Problem PI has a nonlinear objective function, but nonetheless turns out to be 
an easy optimization. It can be solved by a “greedy” technique where shots are al- 
located sequentially, with each shot being allocated to the target for which the in- 
cremental target value killed is maximal. Imagine that the problem has already 
been solved, with x being the vector of optimal allocations, and that one more shot 
becomes available. If the shot is allocated to target j, the decrease in surviving 
value will be v .qp - Vjqp*^ =Vj.qp {I- qj) . The greedy technique starts out 

with X = 0, and then follows the principle “always assign the next shot to the target 
for which the decrease in surviving value is largest.” The procedure is efficient, 
and can easily find optimal allocations when targets and shots number in the thou- 
sands. 

Example 7: Suppose there are three targets, all with value 1, and q = (0.1, 0.5, 0.9). 
Using the greedy method, the first 4 shots should be at targets 1,2, 2, and 2, with 
the total surviving value after 4 shots being 0.1-1-0.125-1-0.9 = 1.125 . Note that 
most shots go to the target that is neither too easy nor too hard to kill. Target 1 is 
so easy to kill that a single shot will suffice, and target 3 is so hard to kill that 
shots against it are nearly wasted. The fifth shot, however, should be on either tar- 
get 1 or target 3, since either shot will reduce the surviving value by 0.09, which 
exceeds the 0.0625 reduction available from target 2. 



2.4.3 Diverse Shots, Diverse Targets, Optimal Shooting 

In the 1991 Gulf War, the allies were confronted with the problem of attacking 
Iraq’s air defense system in a single, coordinated attack. An air defense system 
consists of multiple targets, and weapons of various kinds (aircraft, helicopters, 
cruise missiles) were available to attack it. The planners of that attack had to de- 
cide which weapons should attack which targets, and to do so without the prospect 
of getting much information feedback, since it was imperative that the attack hap- 
pen quickly. This section deals with such situations, but one important aspect of 
that attack will be missing in this chapter. The actual attack anticipated losses to 
the attackers, with the losses depending very much on who attacked what. Such 
two-sided models will not be considered until Chapter 4. 

Since the shots are diverse, the miss probability now depends on both the tar- 
get and the weapon. Let qy be the probability that a weapon of type i will miss tar- 
get j if assigned to attack it. All attacks are assumed to be independent, so power- 
ing up still applies. If Xy weapons of type i are assigned to attack target j, the miss 
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probability is therefore The object is now to solve problem P2 defined 

below: 



Minimize Zv,r[9,./% (2.34) 

./ J 

subject to < b; for all weapons i, and all variables nonnegative integers. 

j=i 

The greedy method of always making the next assignment to reduce the total 
surviving value as much as possible does not work on problem P2, as shown in the 
following example: 

Example 8: Suppose there are 2 targets, each with value 1, and 2 weapons. 

Weapon A has miss probabilities of 0 and 0.1 on the 2 targets, while weapon B 
has miss probabilities of 0.1 and 0.5. The greedy method would first assign A 
to target 1, since A will certainly kill it, and would then assign B to target 2. 
The total surviving value would be 0 + 0.5 = 0.5. The optimal allocation is just the 
reverse - if B is assigned to 1 and A is assigned to 2, the surviving value is only 
0.1 +0.1 = 0.2. Weapon A is more effective than weapon B against all targets, and 
the greedy mistake is to lose sight of all other facts in assigning weapon A. A real- 
istic version of this example might have craise missiles playing the role of weapon 
A. Effective but expensive weapons such as cruise missiles should be used on 
high-value targets that are hard for other weapons to kill, so one needs to have a 
global perspective in assigning them. 

Problem P2 is sometimes referred to as the weapon target assignment prob- 
lem. It is known to be of a fundamentally difficult type, so no simple method like 
the greedy method will be able to solve it. Optimal methods can be expected to be 
lengthy on large problems, but a variety of efficient approximations and bounds 
have been investigated (Ahuja et al., 2003; Washburn, 1995a). 



2.4.4 Identical Shots, Identical Targets, Random Shooting 

Combat sometimes happens fast enough or communications are sufficiently diffi- 
cult that deliberate attempts to jointly optimize fires are impossible. Here we in- 
vestigate the consequences if several identical shots are randomly aimed at several 
identical targets, rather than optimally aimed as in Section 2.4.1. The assumption 
of randomness reflects the idea that coordination is impossible, so that a marks- 
man is as likely to shoot at one target as another, independent of the other marks- 
men. 

Thomas (1956), in the context of a situation where several bombers are sud- 
denly ambushed by a group of interceptors, derives the probability distribution of 
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the number of surviving bombers. Using the notation of Section 2.4.1, let 
RSurv(x;b,n,q) be the probability that x targets survive when b shots with individ- 
ual miss probability q are randomly distributed over n targets. Thomas shows that 
the probability mass function ofX, the number of survivors, is given by 



RSurv(x;b,n,q) 




(l-g)(x + 7) 

V i Jv n 



;x = 0, 



n 



(2.35) 



The function RSurv() is also provided in Chapter2.xls. See Exercise 13. 

The expected value of X is comparatively easy to derive. Consider any par- 
ticular target. That target will be killed by any particular shot with probability 
(1 — q)ln, since the target is chosen by the shot with probability Hn. Since there 
are b shots, each of which has an independent chance of killing the target, we can 
power up to find the target’s survival probability, which is (1 -(1 -q)lnf. Now, let 

4 indicate whether target x is killed (/^ = 1) or not (/^ = 0), so that X= ly H 1- /„ . 

We know that E{1^ = (1 - (1 -^)/«)* for every x. Since expected values and sums 
commute, even when the summed random variables are dependent, as they are 
here, the expected number of survivors is 

£(X) = n(l-^)*. (2.36) 

n 



A comparison between random and uniform allocation of shots is of interest 
for the case where all shots and targets are otherwise identical. We expect that 
(2.32) will show a larger average number of survivors than (2.36), since surely 
there is no advantage to the marksman for aiming randomly. This turns out to be 
tme. When ^ = 0.6 and n = 100 (or any large number), the average survival prob- 
ability E{X)ln is shown in Figure 5 for varying numbers of shots per target (k). The 
curves differ, but by surprisingly little, especially when k is either small or large. 
See sheet “OptRand” of Chapter2.xls to experiment with changes to n or q. 
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Shots per target (k) 



-Optimal 
■ Random 



Figure 5: E(^/n for optimal and random shooting, q = 0.6. 



2.5 Further Reading 



A great deal of work on aiming and kill probability computation was inspired by 
World War II. Morse and Kimball (1950, Chapter 5) record some methods used in 
the United States during that war. Wartime Soviet work on artificial dispersion 
and other topics can be found in Kolmogorov (1948). Eckler and Burr (1972) is a 
good summary of subsequent work up to 1972, or see Przemieniecki (2000), 
which includes some basic computer programs for computing kill probability. 

Although this chapter emphasizes firing problems in two dimensions, there 
are also one-dimensional problems of interest. Morse and Kimball (1950, Chapter 5) 
consider the problem of bombing a railroad track, and other linear structures such 
as roads or power lines may also prompt analysis. One advantage of one- 
dimensional problems is that the two-dimensional difficulty of packing the plane 
with circles disappears. With cookie-cutter weapons, in the absence of significant 
dispersion errors, the best pahem in one dimension is typically a “stick” that cov- 
ers an interval with no gaps. Morse and Kimball observe this, and David and 
Kress (2005) generalize to situations where the bombs are heterogeneous and 
asymmetric in their effects. 

The first few sections of this chapter treat firing errors as being one of two 
types. The first type is where all firing errors are mutually independent, and the 
second type adds the possibility that there may be a common component. Neither 
of these assumptions fit well for rapid-fire weapons, where the sequence of errors 
is better viewed as a stochastic process. Work on applications of such processes 
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was inspired by aerial combat in World War II in Great Britain, and subsequently 
reported in Cunningham and Hynd (1946). See also Fraser (1951, 1953). 

There is much in common analytically between the problem of firing at a set 
of diverse targets and the problem of trying to detect a stationary target that is lost 
among a set of cells. Just as shots are allocated to targets with the object of killing 
them, units of search effort are allocated to cells with the object of detecting the 
lost target. Problem PI in Section 2.4.1 serves both points of view and has bene- 
fited analytically from analysts in both camps. Perusal of the part of search theory 
literature devoted to stationary targets may prove fruitful to someone interested in 
firing theory. See Stone (1975), for example. 

All military services include killing targets among their duties, and many 
weapons are shared among the services. It therefore makes sense to have a joint 
organization responsible for determining how to calculate kill probabilities. In the 
United States, this is the Joint Technical Coordinating Group for Munitions Effec- 
tiveness, or JCTG/ME. The principal products of the JTCG/ME are Joint Muni- 
tions Effectiveness Manuals (JMEMs), some of which are in the form of standard- 
ized computer programs. JMEMs employ some of the methods covered earlier in 
this chapter, but also other methods that are more computer intensive and include 
more detail. These manuals (there are hundreds of them) also include classified 
data for speciftc weapons and targets. At the unclassified level, a good way to find 
out more about JCTG/ME or JMEM is to go to the Federation of American Scien- 
tists (FAO, 2008) and search on either of those acronyms. 




42 



2 Shooting Without Feedback 



Exercises 

( 1 ) Suppose D{r) = l- rifr<l;Oifr>l. What is the lethal area? 

Ans. nJl) 

(2) Plot Z)(r) for the target illustrated below, assuming that the weapon must hit 
the shaded area and that the impaet point is {r, 9) with 0 uniformly random 
in [0, 2;r]. Show that the lethal area is equal to the area of the target. 

Ans. D{r) is a step funetion, a = 2.5k 




(3) Show that (2.4) produees kE{R^) for lethal area, where E{F?) is eomputed us- 
ing the density funetion given by (2.6). Hint: use integration by parts. 

(4) Derive (2.7). 

(5) When aiming errors are basieally angular, the miss distanees should inerease with 
range. Suppose several independent shots are taken at a target, with = 0.1r„ 
where r, is the * range, and that the eookie-eutter lethal radius is 1. If the sueees- 
sive ranges are 10, 11, 12, . . ., inereasing by unity with eaeh shot, eompute Pk for 
the first shot, the first five shots as a group, and the first 10 shots as a group. 

Ans. {Pk (1) = 0.39, Pk (5) = 0.84, Pk (10) = 0.93). 

(6) A weapon with a radially symmetrie DG damage funetion is aimed at a ter- 
rorist eamp that is loeated 100 m from a baby-milk faetory. The weapon has 
different lethalities for eamp and the faetory. The parameter h (the eom- 
mon value of bx and bj) is 50 m for the eamp and 80 m for the faetory. The 
eireular normal firing error has standard deviation 50 m. What are the 
probabilities of killing the eamp and the faetory? 

Ans. (0.50 for the eamp, 0.41 for the faetory) 

(7) An aireraft attempts to kill a tank as follows: It first drops a eanister of 
“stiekers” in the hope that one will hit the tank and aetivate. If a stieker aeti- 
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vates, it can guide a projectile to the tank. The canister opens and scatters 
1000 stickers, with the amount of scatter being under the control of the de- 
signer. The exposed area of the tank is 100 m^. The aircraft makes a 
two-dimensional error with standard deviations (100 m, 300 m) in dropping 
the canister. What is the probability that a sticker hits the tank, assuming a 
well-designed canister? If the tank is longer than it is wide, does the direc- 
tion of the aircraft’s approach matter? 

Ans. (0.275, no) 

(8) Suppose you are given 16 detection devices, each of which is guaranteed to 
detect a target if and only if the relative distance is either less than 4 miles or 
between 30 and 33 miles (the “convergence zone” phenomenon in the ocean 
might be one explanation for such an assumption). The devices can be 
placed in any pattern whatever, and the object is to detect a target whose lo- 
cation relative to some known point is circular normal with standard devia- 
tion 30 miles in each direction. There are no dispersion errors. 

(a) Estimate px- 

(b) Make up a pattern and test it by writing a 5000-replication computer 
simulation. 

Ans. The lethal area is + 33^ - 30^) = 205;r square miles, so 
z = (16)(205)/1800 = 1.82. Given that the shape of the lethal area makes con- 
siderable overlap inevitable even in the absence of dispersion, a confetti ap- 
proximation is natural. The SOLR formula produces Pk ~ 0.57. This exam- 
ple has been the result of considerable experimentation, with the best pattern 
as of this writing having a detection probability of 0.64. The exact answer is 
unknown. 

(9) Suppose 10 cookie-cutter shots are available, with the lethal radius being 
30 ft for each. Estimate Pk for the area target and errors considered in Sec- 
tion 2.3, assuming that 

(a) the wind error is dispersion 

(b) the wind error is bias 

Ans. Using the SOLR formula in both cases, the expected fraction of the tar- 
get killed with an optimal pattern would be approximately 0.316 in case a), 
or 0.275 in case (b). 

(10) Use sheet “DGPattn” of Chapter2.xls to verify the two DG claims made in 
Example 4. Hint: To make the weapons have the right lethal area, you must 
set the DG lethality parameter b so that 2nb^= 7r(7.5)^. 

(11) Consider problem PI of Section 2.4.2, with n = 3, v = (1,2,3), and 
q = (0.3, 0.5, 0.8) . Allocate the first four shots using the greedy algorithm. 
Ans. The shots should be in the order 2, 1,3, 2, and the average value sur- 
viving should be 3.2. 
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(12) Write a computer program to solve problem PI of Section 2.4.2, using the 
greedy algorithm. Use it to solve Example 7. 

(13) Insert a new worksheet into Chapter2.xls and use it to analyze a situation 
where there are n = 3 targets and b = A shots, each of which has a miss prob- 
ability ofq = 0.6. 

(a) Use the Surv{x',b,n,q) function to compute the probability distribution 
of the number of survivors when b shots with miss probability q are distrib- 
uted evenly over n targets. Use that distribution to compute the mean number 
of survivors, and verify that (2.31) gives the same result. 

(b) Use the RSurv(x;b,n,q) function to compute the probability distribu- 
tion of the number of survivors when b shots with miss probability q are dis- 
tributed randomly over n targets. Use that distribution to compute the mean 
number of survivors and verify that (2.35) gives the same result. 

Ans. A partial answer is that the mean number of survivors should be 1.56 
and 1.69 in the even and random cases, respectively. See sheet “Surv”of 
Chapter2.xls for a solution. 



(14) If formulas 2.7 and 2.8 are both tme, show that CEP must be related to a as 
claimed in Section 2.2.2. 



(15) Consider problem P2 of Section 2.4.3, with v = (3,1) and Q 



0.5 


0.0 


0.6 


0.5 


0.7 


0.5 



There is a single weapon of each of the three types, and you are to assign 
weapons to maximize the total value killed, or equivalently to minimize the 
total surviving value. The best that can be done with one weapon is to assign 
weapon 1 to target 1, since this will kill 3(0.5) units of target value. Com- 
plete the application of the greedy algorithm to this problem, find the opti- 
mal assignment, and compare the two. 

Ans. The greedy algorithm results in weapons 1 and 3 assigned to target 1, 
with a total surviving value of 3(0.5)(0.7) -I- 1(0.5) = 1.55. The optimal as- 
signment has weapons 2 and 3 assigned to target 1, with a total surviving 
value of 3(0.6)(0.7) 1(0) = 1.26. 



(16) Page “Shotgun” of Chapter2.xls is a Monte Carlo simulation of a shotgun 
with 50 pellets trying to hit a target in the face of bias error. Relative to the 
bias, each pellet hits independently with a uniformly distributed angle and a 
dispersion distance X whose cumulative distribution function is 
F(x) = l-exp(-(x/b)^) . This is a Weibull distribution, which is often real- 
istic and always easy to simulate because it is possible to solve the equation 
for X. Now suppose that the only control you have over the pattern is through 
the dispersion parameter b. This control might be exercised through a choke 
on a shotgun or through the dispersement altitude if the pellets are actually 
bomblets. Experiment with the sheet to find the best value of b when the bias 
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standard deviation is 50 and the lethal radius is 1 0 and compare the resulting 
kill probability with the SOLR formula. Use SimSheet.xls to replicate the ex- 
periment a large number of times. 

Ans. With b set to 50, 60, 80, and 100, the kill probability after 3000 replica- 
tions is approximately 0.376, 0.416, 0.393, and 0.328, respectively. The best 
setting for b is about 60, and the SOLR formula comes close to predicting 
the optimized kill probability. 

(17) The DG damage function (2.9) can be reduced to one dimension (the x-dimension, 
say) by setting |j.y= (7y= 0, so Section 2.3.3 and its accompanying sheet 
“DGPattn” of Chapter2.xls also apply to finding multiple-shot patterns in 
one dimension. Use that sheet to find the optimal placement of four identi- 
cal bombs with unit reliability, bx= 30 and <5x= 30. Assume that the com- 
mon error has standard deviations of (au, Cty) = (100,0); that is, there is no 
bias error in the vertical dimension. In using that sheet, the value of by is 
irrelevant as long as it is not 0. 

Ans. The optimized pattern is (-100, -30, 30, 100), with a kill probability of 
0.68. You will not discover that fact if you start at (0, 0, 0, 0), since all de- 
rivatives are 0 there and Solver will therefore not move the aim points, so 
use some other starting point. If you increase the dispersion to <5x= 100 and 
re-optimize, all four bombs should be aimed at the origin. This is typical 
when the dispersion error is large. 




Chapter 3 

Shooting with Feedback 



I didn ’t know the gun was loaded. 
And I’m so, so sorry, my friend. 

I didn ’t know the gun was loaded. 
And I’ll never, never do it again. 

(lyrics by Hank Fort and Herb Leighton, 1949) 



3.1 Introduction 



This chapter deals with shooting situations where there is information feedback 
between shots and salvos of shots. The effect of this information is to decrease the 
number of shots required, to increase the kill probability, or even (as in the intro- 
ductory quote) to discover something about the weapon doing the shooting. In 
spite of the implied extended time period, we assume as in Chapter 2 that shooting 
is one sided, with no return fire. Although problems with multiple shots are also 
considered in Chapter 2, the addition of information feedback is an essential 
change to the nature of the problem, as will be seen. 

The information feedback can take various forms. One is a report about the ef- 
fect of previous shots on the target. For example, the same radar that keeps track of 
the target’s location might also be able to detect when the target breaks up into 
smaller pieces, thus indicating target destmction. The target’s behavior might also 
change after it is hit. Except for Section 3.3, all problems considered in this chapter 
are of this sort, with the feedback generally being whether the target is alive or dead. 

Another kind of feedback provides information about miss distances, rather 
than the live/dead status of the target. Artillery registration (“zeroing in”) is one 
example of this, and another is the Close In Weapon System (CIWS), a high rate 
of fire gun designed to protect ships from air attack. The CIWS radar measures the 
location of the stream of bullets, as well as the target’s location, and so can pro- 
vide information about the relative locations of shots and target. This information 
can be the basis of a procedure for adjusting the aim point, thus gradually elimi- 
nating bias errors. We consider such problems in Section 3.3. 

Some important tactical questions arising during the shooting sequence are 

• When do we stop shooting? 

• Where do we aim? 

• At which target do we aim? 

The answers will depend on tactical constraints, as well as feedback information 
available. There are situations (CIWS, for example) where both target location and 
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status are fed baek, and target status can be more complicated than a simple 
live/dead report. Feedback can be subject to error. Each unique situation deserves 
its own special analysis, so there is no hope of being comprehensive. The sections 
below describe a few basic situations, but there are many others. 



3.2 Feedback on the Status of a Single Target 



The shoot- look- shoot (SLS) procedure involves feedback about whether the target 
has been killed, which in this section is assumed to be perfectly accurate. The gen- 
eral idea is that one shoots, looks to see whether the target has been killed, and 
shoots again if necessary, possibly repeating this procedure several times. The ad- 
vantage of such information is that it helps to prevent the assignment of additional 
shots to a target that is already dead. 

The reduction in shots required when using SLS can be significant. For ex- 
ample, suppose that an important communications station is to be attacked by an 
expensive missile prior to an attack by land forces. The kill probability is 0.9 per 
shot, but the station is so important that = 0.99 is required. By powering up, 
we see that this can be achieved without feedback by simply assigning two inde- 
pendent missiles to the station. If a look is possible between shots, however, then 
90% of the time a single shot will suffice, since the first shot will kill the target 
with that probability. The second shot is required only 10% of the time, so the av- 
erage number of shots required with SLS is 1 + (0.1)(1) = 1.1, instead of 2. The 
SLS procedure requires just over half of the resources that would be required 
without feedback and has exactly the same kill probability. 

In the extreme case where the number of looks is unbounded, the marksman 
can even adopt the strategy “fire until the target has been killed.” In that case the 
problem is not to compute Pk (which is 1 .0), but rather to investigate the random 
variable N = “number of shots required to kill the target.” If the shots are all inde- 
pendent with kill probability p, then is a geometric random variable (Appendix 
A) with expected value E{Pl^ = 1! p . More generally, if is the miss probability 
of the ith in a sequence of independent shots, then 

E{N) = 'YjP^N > n) = \ + q^+q^q^+q^q^qj^ +.... (3.1) 

n=0 

This general formula reduces to a geometric series with sum l/p in the special case 
where q^ = \- p for all i. The first equality in (3.1), although not the usual defini- 
tion of expected value, can be shown to be valid for any nonnegative integer ran- 
dom variable such as N. 

A similar problem is one where the marksman has n shots available, but they 
all have different costs c„ as well as different kill probabilities p^ = 1 - q^ , for 
i = !,...,« . It is not unusual to have considerable diversity in both cost and effec- 
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tiveness between shot types, especially when the marksman has both guided and 
unguided weapons available. The target is assumed to be so important that the 
marksman will shoot at the target until either it is killed or the marksman runs out 
of shots. The overall miss probability is the product q^q 2 ,...,q„ , regardless of the 

order in which the shots are taken. However, the average cost of shooting depends 
on that order. 

Example 1: Suppose there are two shots, a missile and a bomb, with c = (2,1) and 
p = (1, 0.6). If the missile is used first, then the cost will be 2, since its cost is 2 
and it will certainly kill the target. If the bomb is used first, then the cost will be 1 
with probability 0.6, or 1 + 2 if the bomb misses, for a total average cost of 
0. 6(1) + 0. 4(3) = 1.8 . Since 1.8 is smaller than 2, as long as the time required to 
kill the target is not an issue, it is better to use the bomb first. 

It turns out that the optimal tactic is always to rank the shots in decreasing 
“bang-per-buck” order and then make the shots in that order. In Example 1, 
the ratio scores for missile and bomb are 0.5 and 0.6, respectively, so the bomb 
ranks higher and should be chosen first. See Glazebrook and Washburn (2004, 
Theorem 4) for a general proof of this statement. If large inventories of each shot 
type were present, then the optimal tactic would be to choose the highest ranking 
shot type and use it repeatedly until either it is exhausted or the target is killed. 
The average cost of killing the target with repeated use of shot type i is cjpi, the 
product of the cost per shot and the expected number of shots. That ratio is also 
the reciprocal of the bang-per-buck index, so this conclusion should not be sur- 
prising. 

Example 2: Suppose there are three shot types, with c = (1,2,3) , p = (0.2, 0.5, 0.8) , 
and with two shots of the second type and one of the other two. Since type 3 ranks 
higher than 2, and 2 ranks higher than 1, the shots should be taken in the order 3, 
2, 2, 1. The overall kill probability is l-(0.2)(0.5)^(0.8) = 0.96 . The average cost 

of shooting is 3x0. 83-5x0.2x0.5-1-7x0. 2x0. 5^ -1-8x0. 2x0.5^ = 3.65 . Taking 
the shots in any other order would result in the same kill probability and a higher 
average cost of shooting. Without applying SLS, the same overall kill probability 
would cost 8, since all four weapons would be required. 



3.3 Feedback on Miss Distances 



In this section, we assume that a forward observer provides feedback to the 
marksman on the location of each shot’s impact point, relative to the target, as in 
Figure 1. If the marksman is initially uncertain about the target location, the 
alignment of his own sights or any other source of bias error,this information can 
be used to “zero-in” on the target by adjusting the aim point. The essence of the 
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problem is to adjust without over-adjusting - the marksman must also eonsider 
that there are reasons other than bias for missing the target, notably the independ- 
ent dispersion errors that accompany the shots, for which adjustment of the aim 
point would be unwise. 



Figure 1: The miss distance X is observed and fed back to the cannon, which 
can adjnst its aim point for the next shot. 

It is easiest to begin with an analysis in one dimension. In the shoot-adjust- 
shoot (SAS) procedure, it is assumed that an observer provides the actual impact 
point of the ith shot, relative to the target. The observer reports are useful because 
they help the marksman to estimate whatever bias error B is present, and thereby 
to adjust the next aim point A^+x to take account of it. Assuming that the dispersion 
error is E; for the ith shot, the observer’s ith report is 



The aim point Ai is known and A, can be observed, but B and are unknown, 
except for their sum. The aim point Ai+i should be the negation of the marksman’s 
minimum variance estimate of the unknown bias B, based on the observed miss 
distances Xi,..., Aj. Since B+ Ei is an unbiased observation of B, and since 
B + E. = X. - A . , the next aim point is obtained by averaging: 




X^ — B + E^ + A^ \ i > ^ . 



(3.2) 




(3.3) 



Therefore, with all aim points but the first being given by (3.3), 
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Assuming that the dispersion errors are normal, independent, identically distrib- 
uted random variables with mean 0 and variance <T^ , (3.4) implies that E{Xi+\) = 0 
and, since the variance of a sum of independent random variables is the sum of the 
variances (see Appendix A), 

Var(X.^^) = a^ +... + a^) = + (3.5) 

Equation (3.5) gives the variance of the miss distance for every shot except 
the first, which we regard as a “calibration shof ’ that is incapable of killing the 
target (making Var{B) infinite would have the same effect). Since the miss dis- 
tances after the first can be shown to be independent of each other, the probability 
of kill with a fixed number of shots can be obtained with the same independence 
argument that leads to (2.13). 

Equations (3.3) and (3.4) together imply that 

4^1-4 =-A./;;i >0. (3.6) 

Equation (3.6) states that the aim point for the next shot should be corrected by a 
decreasingly small fraction of the previous miss distance. Note that early miss dis- 
tances are taken more seriously than late ones; the marksman is not inclined to 
make large aim point adjustments late in the firing procedure because the bias has 
almost been eliminated. Barr (1974) makes the point that the normality assump- 
tions made here are not really necessary for the optimality of this aim point correc- 
tion rule. 

It would be simpler to adopt an aim point correction rale, something like a 
procedure that we will call 3CAL: aim the first three shots at the target, thinking 
of them as calibration shots, and then let the aim point for all following shots be 
-(Aj +X^+ X^)l2i , the negation of the average of the reported miss distances. 
Does the simplicity of 3CAL compensate for its smaller effectiveness? Clearly the 
answer is no if there are only three shots, since the three calibration shots will 
never hit the target if the bias is large. The answer can therefore be expected to 
depend on the number of shots. Since the two competing procedures are well de- 
fined, answering this question would be a good application for Monte Carlo simu- 
lation. Sheet “3CAL” of Chapter3&4jcls is such a simulation (see Exercise 8). 

Suppose now that the SAS procedure is carried out independently in each of 
two dimensions, using n cookie-cutter shots with lethal radius R, including the 
calibration round. The two-dimensional miss distances will then be circular nor- 
mal with variance given by (3.5), and therefore, using (2.7) and (2.13) in the same 
manner as in Chapter 2, 






Pf, = I- exp 



(3.7) 
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Note that the effectiveness (CMP - see Section 2.3.1) of the ith shot, com- 
pared to its effectiveness in a problem with no bias error, is {i - 1)1 i. The SAS pro- 
cedure is evidently not completely effective in getting rid of the effects of the bias 
error, except in the limit when there are many shots. There is nonetheless a rea- 
sonable sense in which it is the optimal aim adjustment procedure (Nadler and 
Eilbott, 1971; Barr, 1974). 



3.4 Shoot Look Shoot with Multiple Targets 



When multiple targets are present, the range of alternative actions expands consid- 
erably. There are many possibilities for the nature of the feedback, objective func- 
tion, and tactical constraints. Here we examine only feedback about target status. 
As in the case of a single target, the significance of this feedback is that it permits 
one to avoid shooting at targets that are already dead. 

Our goal will be to kill as many targets as possible, or more generally as 
much target value as possible, on the average. It would of course be equivalent to 
minimize the value of the surviving targets, and it will sometimes be convenient to 
do that. This is a general-purpose goal - a more specific goal will be considered in 
Chapter 4. The general-purpose goal is appropriate when the benefit from killing 
targets is proportional to the number killed. An example of a situation where this 
is not trae would be an attack on a redundant communications system consisting 
of three repeaters. Killing two of the repeaters is useless, since only one is re- 
quired, so the value of doing so is not 2/3 of the value of killing all three. 



3.4.1 Identical Shots and Identical Targets with a Time Constraint 

We assume that the shots all have independent effects, with single shot kill prob- 
ability p = l-q, and that the time constraint takes the form of a limitation on the 

number of salvos that are possible. The number of salvos might be limited because 
of the time to reload a launcher or because of the time required to assess target 
status - see Section 4.2.1 for a geometric analysis of one self-defense situation. 
Were it not for this constraint on the number of salvos, the optimal firing policy 
would be “shoot once at every live target, assess the status of each target, and then 
repeat until there are either no weapons left or no targets left.” The only difficulty 
with that policy is that there might not be enough time to carry it out. 

Consider an air defense system that has nine missiles available and has de- 
tected a flight of three enemy aircraft. Considering the speed of the aircraft and the 
time required to carry out each engagement, including status assessment, assume 
there is time for only two salvos. The question is, how many missiles should be 
fired against each of the aircraft in the first salvo? Launching three missiles 
against each aircraft would leave no missiles available to deal with any survivors 
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of the first salvo and might waste several missiles on targets killed by other mis- 
siles. Launching only one missile against each aircraft might leave several aircraft 
still alive with only one salvo remaining. Might it be optimal to launch two mis- 
siles against each aircraft, or, for the sake of argument, two missiles against one 
aircraft and three against each of the other two? Any of these answers might be 
correct, depending on circumstances. What we need is a reliable method for de- 
termining the best firing policy. 

Since all targets are identical, each salvo should be spread as evenly as possi- 
ble over the remaining targets. The firing question thus reduces to detemiining 
how many shots x to spend in each salvo. Let random variable The the number of 
targets surviving out of t when x shots are fired, and let 7^, (^, t) be the number of 
targets that can be expected to survive n additional salvos when s shots and t tar- 
gets remain. Statements such as the previous sentence are often the introduction to 
an analysis by dynamic programming, as is the case here. Since all targets will 
survive if there are 0 salvos remaining, we have {s, t) = t , and for n > 0 we have 
the iteration 

F„{s,t) = E{F„_,{s-x,Y)). (3.8) 

Equation (3.8) is at the heart of the dynamic programming procedure. It says 
that the average number of targets surviving is the same as the average number 
that survive when the number of salvos is reduced by 1, the number of shots is re- 
duced by X, and the number of targets surviving the current salvo is Y. The ex- 
pected value is needed because of the randomness of Y. Note the distinction be- 
tween Y, the number of survivors of the salvo taken when n salvos remain, and 
F^ {s, t) , the (minimized) expected number of survivors of all n remaining salvos. 

The decision variable x in (3.8) can be any integer between 0 and s and should be 
chosen to minimize the expected number of targets surviving all remaining salvos. 
The desired probability distribution of Y is Surv(y; x, t, q) as given by equation 
(2.31), since x shots should be spread as evenly as possible over t surviving tar- 
gets. 

The basic idea in solving (3.8) is to construct a sequence of tables indexed by 
n, starting at 0 and gradually increasing n until the number of salvos available is 
reached. Since F^(s,t) is known for all 5 and t, (3.8) pemits F^(s,t) to be com- 
puted for s and t ranging from 0 up to the maximum number of shots and targets. 
Once F^(s,t) is known, (3.8) permits the computation of F^(s,t) over the same 
range of s and t, since everything on the right-hand side is now known. With 
F^{s,t) known, F^{s,t) can be computed, etc. Finding the minimum is done by 
simply trying all feasible values for x. As computations proceed, the optimal value 
for X is recorded and written in a separate table, for later recall when needed. One 
feature of this procedure is that, if the number of salvos is limited to (say) 3, then the 
optimal policies for 1 and 2 salvos must be determined before considering the opti- 
mal policy when three salvos remain. This is actually a tactical advantage, since a 
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three-salvo problem will become a two-salvo problem right after the first salvo is 
fired. 

It may seem that the procedure described above is simply optimization by ex- 
haustion, since all possibilities for x are considered in the minimization step. This 
is not the case. The computations are more efficient than exhaustion because it is 
never necessary to consider firing policies that encompass multiple salvos, as 
might be thought necessary. We never have to consider complicated strategies 
such as “fire 5 in the first salvo, and then 3 in the second unless only one target 
remains, in which case . . . and then in the third salvo . . . .” Each computation sim- 
ply minimizes the number of survivors with one less salvo remaining, until the ac- 
tual number of salvos is finally considered. Equation (3.8) is an application of dy- 
namic programming (DP) to a Markov decision process (Puterman, 1994), a 
technique that is often useful in optimization problems that involve exhaustible re- 
sources and time. We will employ it repeatedly, generally referring to “stages,” 
rather than salvos. DP is characterized by a functional equation like (3.8) where 
the objective function (FO, in our case) appears, albeit with different arguments, 
on both sides of the equation. 

Correct identification of the state is a crucial part of DP problem formulation. 
It may help to imagine a “change of command” in the middle of the process, with 
the state of the process being whatever information the old commander should 
transfer to the new one. In the problem under consideration, the state is («, s, t), 
where the three variables are the number of stages left, the number of shots left, 
and the number of targets still surviving. The past may have much more detail 
than that, but all such detail is irrelevant for purposes of future action - three 
numbers suffice. Once the state is identified, the DP procedure is always to define 
an objective function that depends only on the state variables, and then write down 
an equation such as (3.8) that has the effect of advancing the stage index by unity. 

Sheet “DPShooter” of Chapter3&4.xls solves a firing problem with three 
salvos by this technique. The single-shot kill probability can depend on the salvo 
index, so there are three such inputs, rather than just one. It is suggested that the 
reader experiment with that sheet, especially if DP is a new subject. Do not forget 
to press the “DP Solve” button after changing the inputs, since results are not up- 
dated automatically. Sometimes the optimal results are obvious. For example, the 
dynamic program concludes that all remaining shots should be fired when there is 
only one salvo remaining, which should require no analysis whatever. Other re- 
sults are not obvious, especially when the kill probabilities differ among the 
stages. To see the code that accomplishes the optimization, press ALT-Fll to 
get the VBA editor, and then double-click sheet “DPShooter” in the project ex- 
plorer. The code first reads the numbers of survivors with no salvos left from the 
spreadsheet, and then optimizes shooting with 1, 2, and 3 salvos remaining, fol- 
lowing (3.8), and outputting those results to the spreadsheet. 

The exact computations described above can be approximated. The simplest 
of these approximations is to apply expected value analysis (EVA) as introduced 
in Chapter 1, combined with relaxing the integer requirement for the number of 
shots per target. Thus, ifv shots are spread evenly over t targets, the survival prob- 
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ability of each target is q’‘^‘ , so, using EVA, targets survive for the next 
salvo. If ti is the number of targets remaining after the ith salvo, then (3.8) can be 
replaced by ; i > 0 , with to being the given initial number of targets. If 

there are n salvos, then t„ should be minimized subject to the constraint that 

<s , where ^ is the number of shots available. Sheet “EVA” of Chap- 

1=1 

ter3&4.xls employs Excel’s Solver in this manner. This is a much simpler analysis 
than (3.8), but the results of the two methods agree only approximately. Exercise 
10 is related. 



3.4.2 Variety of Shots and Targets, No Time Constraint 

In this case shots can be made one at a time, thereby avoiding the possibility of 
shooting at a dead target. The complication arises because it is not clear which 
weapon to assign to which target (in this section we refer to “weapons”, rather 
than “shots”, to prevent confusing statements such as “we first shoot shot 2”). 
Since the targets are all different, we adopt the goal of killing as much target value 
as possible, with target j having value v,. Let py be the known kill probability if 
weapon i is assigned to target j, possibly computed using one of the methods of 
Chapter 2. A recursive relationship such as equation (3.8) is still possible, but we 
can no longer merely keep track of the numbers of remaining weapons and targets, 
but must instead keep track of the sets of those. Call those sets S and T, and let 
V {S, T) be the maximum amount of target value that can be killed, on the aver- 
age, if the set of S weapons is optimally applied to the set of T targets. If either S 
or T is empty, then of course F(5, T) = 0. 

V (5, T) can be found by a dynamic programming recursion. Since each shot 
either kills its target or not, and since all shots are independent, we have 

V{S, T) = max,,,., {;,, (v, + V{S -i,T- j)) + (1 - Py)V{S - i, T)}. (3.9) 

If the first shot by weapon i is aimed at and kills target j, then the total value 
killed, on the average, is Vj + V{S-i,T - j) . If the first shot misses, then the total 

average value killed is V(S-i,T) . Averaging the two possibilities, we have (3.9). 
To use (3.9), we first evaluate V(S,T) for all target subsets when 5 has only one 
weapon in it. Next we can solve all problems where S has only two weapons in it, 
since S—i will have one weapon if S has two, and so on. The number of evaluations 
of (3.9) required to ultimately complete this task can be very large, especially if S 
and T are large sets. Bellman (1961) refers to this as “the curse of dimensionality.” 
Dynamic programming still works in principle, even in the presence of the curse. 
As a practical matter, however, it can only be used on small problems. 
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If Vj = V = p for all i and j, as in Section 3.4.1, then (3.9) simplifies 

considerably because the sets S and T merely need to be counted. Let 5 and t be the 
numbers of weapons and targets, and let X be a binomial random variable with 5 
trials and p success probability. X can be interpreted as the number of effective 
weapons. The number of kills will be X unless the marksman mns out of targets, 
so V {S ,T) = E{mm{X ,t)) , a relatively simple computation. Anderson (1989) 
notes that the same formula applies as long as Py does not depend on j. See also 
Przemieniecki (1990) and Exercise 7. The same results could be obtained from 
(3.9) with a very large value of n, of course, but with much more difficulty. The 
associated optimal firing policy is trivial: “shoot at any live target as long as you 
have any weapons left.” 

Since (3.9) is problematic for large problems, we next develop some bounds 
on V {S, T) in the general case. 

A simple lower bound V _(S,T) can be constructed by computing the opti- 
mal pair (;*,y*) = argmax.^^^.gj,v^/7,^. . This is the “myopic” firing policy that 
maximizes only the immediate gain - every shot is taken as if it were the last. 
Equation (3.9) (with substituted for (i,j) and V_() replacing V() on both 

sides) must still be employed to evaluate V _(S, T ) , so exact evaluation is still dif- 
ficult. Monte Carlo simulation would be a good way to approximate V _{S, T ) , 
but the myopic policy is trivial to implement in any case because knowledge of 
V _{S, T) is not needed to do so. 



Example 2: The myopic policy is not always optimal. Consider two weapons and 



two targets, withP 



1 

0.9 



0.9 

0 



and V = (1, 1) . P is a matrix of kill probabilities. 



with rows for weapons and columns for targets. The myopic policy will assign 
weapon 1 to target 1, since a gain of 1 is better than a gain of 0.9. Weapon 2 is 
then useless, since it has no capability on target 2. The optimal policy will first as- 
sign the second weapon to target 1, and then the first weapon to target 1 in case of 
failure, or to target 2 in case of success. The optimized total score is 0. 9(1+0. 9) + 
0. 1(0+1) = 1.81 - a substantial improvement over the myopic score of 1.0. 



An upper bound applicable in more general circumstances is derived in the 
next section. 



3.4.3 Variety of Shots and Targets, Unreliable Feedback 

So far, we have only considered problems where the information feedback con- 
sists of a reliable live/dead report for each target. There are many other possibili- 
ties. We might measure things such as the target’s radar cross section, its tempera- 
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ture or the kind of noise that it is making. Each of these is related to whether the 
target is live or dead, but not a precise statement about which is the case. Some of 
the relevant literature for problems of this sort will be mentioned in the next sec- 
tion, but the analyses tend to be beyond the scope of this book. In this section we 
consider only an upper bound on the effects of firing that is valid no matter what 
kind of information about target status is fed back, as long as the single shot kill 
probabilities against live targets remain constant. When information is imperfect, 
it is possible that shots will be made against targets that are already dead. As long 
as the target is still alive, however, the kill probability is assumed to remain py as 
long as the shot is of type i and the target is of type j. 

Fixed sets of weapons and targets are given. Define a collection of indicator 
random variables (random variables whose only possible values are 0 and 1) that 
can be associated with any firing policy: 

Yij = 1 if target j is killed by weapon i (for each j, at most one of these is 1), 

y, = 1 if target j is killed, and 

Xij= 1 if weapon i is assigned to target j (for each i, at most one of these is 1). 

The firing policy will induce many dependencies between these random variables, 
so independence assumptions among them are not appropriate, but the collection 
is still useful for formulating the problem of finding the optimal policy. We first 
note thaty^. = '^Y.j , and that the total value killed isZ = '^vYj . The problem 
' ./ 

(call it PI) of finding the optimal policy can therefore be posed as maximizing z, 
the expected value ofZ, subject to the following constraints: 

(a) < 1 for all i with certainty, 

j 

Yy for all j with certainty, 

(c) Y. < 1 for all j with certainty, and 

(d) other constraints. 

The other constraints in PI include the crucial relationship between Ay and Yy, 
the essence of the firing policy. For example, the (probably foolish) policy of ig- 
noring all information and simply making A, i=l for all i would probably kill target 
1 , but would also result in T, = 0 for y > 1 . There are potentially an astronomical 
number of firing policies, since the decision about what to do next can depend in 
many ways on the information available. Nonetheless, regardless of the policy 
employed, we assume that E{Yy) < p^yE{Xy) , with strict inequality being possi- 
ble because target j might aheady be dead when weapon i attacks it. Now, using 
lower case letters for expected values of random variables (so Vj =E(Yj) and 

Xij = E{X..) ), we can construct a relaxation of PI that we name P2: 
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maximize 2 = subject to 

/ 

(a) ^ Xy < 1 for all i, 

j 

(b) yj<Y,XyPij for ally, 

(c) ^ 1 for all j, and 

(d) all variables nonnegative. 

P2 is a relaxation of PI because a relationship that is true with certainty will 
also be true on the average, because sums and expected values can be inter- 
changed, because < Xy py by assumption, and because the other constraints 
(d) of PI have simply been omitted in P2, except for the part that requires non- 
negativity. Since P2 is a relaxation of PI, P2 provides an upper bound on what is 
achievable with any shoot-look-shoot policy. Since P2 is a linear program (Ap- 
pendix B), it can be efficiently solved. 

P2 has a direct interpretation where Xy is the probability of using weapon i on 
target j, and yj is the probability that target j is killed. In P2, (a) requires that 
weapon i not be used more than once, (b) requires that the effect of each weapon 
assigned to target j not exceed py, and (c) requires that target j be killed at most 
once. 

If hi weapons of type i are actually identical, then constraints (a) of P2 can be 
changed to ^ x,. <bi, a simple consequence of collecting terms with identical co- 

./ 

efficients in P2. In that case the interpretation of Xy is “average number of weap- 
ons of type i used on target y.” The upper bound calculations are trivial compared 
to the solution of (3.9), especially if there are large groups of identical weapons. 

The reader may wish to experiment with sheet “Optimal” of Chapter3&4.xls. 
The command buttons carry out three computations. One is the calculation of 
V(S,T) using (3.9). This calculation is potentially time consuming, so try a small 
problem first. The second is the myopic calculation of Section 3.4.2, and the third 
is the upper bound just derived. The upper bound computations are carried out us- 
ing data on sheet “LP Upper,” but that sheet need not be examined unless the 
reader is interested in the details of solving a linear program using Excel’s Solver. 
The reader can verify that the upper bound for the small example introduced in 
Section 3.4.2 is actually exact, and that the upper bound is usually close to the 
tmth for problems not designed to make it look bad. 

Example 3: This example is designed to make the upper bound look bad. Con- 
sider a problem with one target and two weapons, each of which will kill the target 
with probability 0.5. The only available allocation of weapons produces a kill 
probability of 0.75, while the solution of P2 is that z can be made 1 by setting 
x,j = 2 andyj = 1 . 



3.5 Further Reading 
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Example 4. Suppose there are four hard targets with value 50 each, four soft tar- 
gets with value 100 each, and five false targets with value 0. Each of these targets 
is either live or dead at all times, and some may be dead initially. The marksman 
has available five bombs and four missiles. A bomb will not kill a hard target, but 
will kill any other target with probability 0.5. A missile will kill a hard target with 
probability 1, or any other target with probability 0.5. Before each salvo, the 
marksman has an information system that examines the battlefield and possibly 
gives him a report about the existence and status of each target. Some targets may 
be omitted from the report, the report may be error prone, and reported informa- 
tion may be only indirectly relevant (a target’s temperature, for example). The 
marksman may be initially ignorant of the nature of the battlefield, including the 
information specified above. Determination of an optimal firing policy would re- 
quire much more detail about the marksman’s initial state of uncertainty and the 
nature of the battlefield reports, but there is enough information given above to 
compute an upper bound on the results of using that policy. All that is required for 
computing the upper bound are the sets of weapons and targets, the target values, 
and the kill probability matrix. Although the false targets might be a significant 
problem to an actual marksman, the upper bound calculations simply omit them 
(see Exercise 1 1). 

The upper bound can be easily computed even in circumstances where finding 
the optimal policy according to (3.9) or (3.8) is computationally impossible. Lin- 
ear program P2 is solvable in a few seconds even for thousands of weapons and 
targets. 



3.5 Further Reading 



The optimal use of information is a difficult subject in general, and firing prob- 
lems are no exception. This is a bit of an analytic crisis for military analysis, since 
modem military systems rely heavily on the timely acquisition and distribution of 
information. Information is not free, so there is sometimes a need to evaluate its 
effectiveness in quantitative terms, but evaluation is hampered by the analytic in- 
tractability of many of the associated problems. The models presented in this 
chapter, while hardly trivial, are only a small start on a large problem. 

Error-prone target status information is common in reality, but not discussed 
above except for the bound in Section 3.4.3. Such information leaves the marksman 
in doubt about the status of targets, so it would be natural to introduce a probabil- 
ity distribution for the state of the target or the states of each of several targets. 
One can apply Bayes theorem to update the target state distribution(s) after each 
report. In some cases a myopic or “greedy” strategy something like “always shoot 
at the target most likely to be alive” can be shown to be optimal (Manor and Kress 
(1997), Glazebrook and Washburn (2004, Section 1)). Aviv and Kress (1997) dis- 
cuss some simple firing procedures that are nearly optimal. If targets differ, but 
there is only one kind of weapon, the shooting process may be indexable in the 
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sense that one should always shoot at the target with the highest numerical index 
(Glazebrook and Washburn, 2004, Section 4). More generally, this line of thought 
ends up with a difficult partially observable Markov decision process where the 
state of the process is itself a probability distribution (Yost and Washburn (2000), 
Glazebrook and Washburn (2004, Section 2)). Since the mere application of Bayes 
theorem can itself be data-intensive, and since Bellman’s curse is operating, it is 
fair to say that techniques such as these are (in 2009) some ways from practical 
implementation. This situation might change as computers get more and more ca- 
pable. In the meantime, lacking theoretical guidance about optimal firing policies, 
military judgment about dealing with uncertainty is often built into combat simu- 
lations instead. 

A practical marksman can be in doubt about more than the target’s functional 
status. The very identity of the target can be in doubt, witness the occasional in- 
stance of military fratricide. False alarms can happen, particularly in antisubma- 
rine warfare where there are many phenomena that can lead to the appearance of 
phantom submarines. There are very few quantitative analyses of how to shoot op- 
timally in such circumstances. 
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Exercises 

(1) Suppose there are four shot types, with the cost and kill probability vectors 
being c = (1,2, 3, 4) and p = (0.2, 0.3, 0.4, 0.5). 

(a) If there is one shot of each type, in what order should the shots be taken, 
what is the resulting kill probability, and what is the average cost of shoot- 
ing? 

(b) Same as part (a), except that there are unlimited numbers of each type. 
Ans: In part (a), the shots should be taken in order 1,2, 3, 4, the kill prob- 
ability is 0.832, and the average cost of shooting is 5.624. In part (b), only 
shot type 1 should be used, the kill probability is 1, and the average cost of 
shooting is 5. 

(2) Suppose you are to make eight shots at a target, each of which has a circular 
normal dispersion error with standard deviation 2 and a cookie-cutter lethal 
radius R. Which of the following three situations will give the highest kill 
probability? 

(a) i? = 1 , and there is no bias error. 

(b) R = 2, there is a circular normal bias error with standard deviation 5, and 
an observer reports miss distances as in the SAS procedure. 

(c) R ^ 3, there is a circular normal bias error with standard deviation 5, and 
there are no observer reports 

Ans: The first two kill probabilities are l-exp(-l) and l-exp(-2.64) , so (b) 
is much better than (a). The answer for (b) uses (3.7) and has nothing to do 
with the number 5, since the SAS procedure eliminates the bias error. The 
exact Pk for (c) cannot be given without examining patterns of 10 shots as 
in Chapter 2, but an upper bound on the kill probability is l-exp(-0.64) , so 
it is surely the worst of the three situations. The best of the three situations is 
thus (b), and the kill probability is nearly 1 . 

(3) With the three miss probabilities all set to 0.5, sheet “DPShooter” of Chap- 
ter 3 &4.xls claims that the optimal number of shots to fire with 2 salvos, 12 
shots, and 5 targets remaining is 6. This is one of a small number of cases 
where the number of shots fired is not a multiple of the number of targets - 
one target is shot at twice, while the other four are shot at only once. The 
sheet also claims that shooting six shots, plus optimal firing in the last salvo, 
will result in 0.4993 targets surviving out of 5. Verify that the same advice is 
given with three salvos remaining if the kill probability for the final salvo is 
set to 0. Do not forget to press the “DP Solve” button, since spreadsheet out- 
put will not be updated unless you do. Setting the final kill probability to 0 
should have the same effect as having one less salvo, so this is an exercise in 
verification. 
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(4) In the same circumstances as Exercise 3, Marie must decide how many shots 
out of 12 to fire at 7 targets when there are 3 salvos left. The “DPShooter” 
sheet advises firing 7 shots, but, for one reason or another, she only fires 4. 
The result of this is that she kills 2 targets, so she has 8 shots and 5 targets 
remaining when there are 2 salvos left. If she fires optimally in the last 2 sal- 
vos, how many targets will she kill, on the average, in the last 2 salvos? 

Ans: 1.2227 will survive, so she will kill 3.7773, on the average. Given that 
there are 8 shots, 5 targets, and 2 salvos remaining, the results do not depend 
on how that state happened to arise. This is the llindamental fact on which 
the whole computational scheme is based. 

(5) In the same circumstances as Exercise 3, the “DPShooter” sheet claims that 
the optimal number of shots is 2 when there are 4 shots, 2 targets, and 3 sal- 
vos remaining. If this advice is followed, the number of survivors with two 
salvos remaining will be either 0, 1, or 2. Compute the probabilities of those 
events and show by averaging the appropriate stage 2 results that the average 
number of survivors after all salvos are complete is indeed 0.375, as the 
sheet claims. 

Ans: (0.25)(0)-b(0.5)(0.25) + (0.25)(l) = 0.375. 



( 6 ) 



Suppose there are three weapons and three targets of equal value, with the 



kill probability matrix being P 



0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 



. Use sheet “Optimal” to 



show that the maximized average number of targets killed with SLS is 1 .683 
with perfect feedback after each shot (the sheet also should show an upper 
bound of 1.711). The firing policy for achieving 1.683 has been computed 
but not displayed, since displaying the method in a spreadsheet would be too 
complicated. Static policies that do not utilize information feedback, as con- 
sidered in Section 2.4.3, are much simpler to explain and evaluate. For ex- 
ample, the policy of firing weapon i at target i for all three targets would 
achieve an average number killed of 0.1 + 0.5 + 0.9 = 1.5 or the policy of fir- 
ing all three weapons at target three would achieve 
1 — (1 — 0.1)(1 — 0.6)(1 — 0.9) = 0.964. What is the best static policy that you 
can find? The average number of targets killed cannot exceed the SLS value 
of 1.683. Does it exceed the myopic value that is also given on sheet “Opti- 
mal?” 

Ans: There are many ways of achieving it, but the best value achievable with 
a static policy is 1.5. 



(7) Section 3.4.2 includes the formula V{S,T) = E{mm{X,t)) for the expected 
number of targets killed out of t when A is a binomial random variable with 5 
trials and p success probability. Write a computer program that will take the 
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three inputs {s,t,p) and compute the average number of targets killed. If you 
are using ExceF'^, make use of the BINOMDIST() function. 

Ans: If (s,t,p) = (15, 10,0.8), then the average number of targets killed 
should be 9.916. Even though the average number of killing shots 
(sXp = 12) is greater than 10, the average number of targets killed is 

slightly less than 10. 

(8) Sheet “3CAL” of Chapter3&4.xls is a Monte Carlo simulation where SAS 
and 3CAL are compared as to their ability to hit a target in one dimension. 
Study the structure of that sheet to verify that the comparison is a fair one. 

(a) Remarkably, both procedures seem to produce exactly the same results 

on the fourth shot, as can be seen by pressing F9 repeatedly. Explain 
analytically why the two aim points are always identical on the fourth 
shot. 

(b) Use “SimSheef’ of AppendixC.xls (see Appendix C) to confirm that SAS 

is superior to 3CAL on the tenth shot by showing that it has a higher 
hit probability. 

(9) The miss distances in the SAS procedure are defined by (3.4) as a function 
of the dispersion errors, and the claim is made in that section that they are all 
independent of each other. The truth of this is not entirely obvious, since 
each miss distance depends on the same sequence of dispersion errors. Prove 
that X 2 is in fact independent of X 4 . Hint: Since the miss distances are all 
normally distributed, they are independent if and only if they are uncorre- 
lated, so it suffices to show that E(X^X^) = 0. 

Ans: Using (3.4), the product X 2 X 4 can be written out term by term. Terms 
where the factors have different subscripts can be ignored, since they aver- 
age 0. Except for such terms, the product is (Ef -El)/3, which also aver- 
ages 0 because the dispersion errors are all identically distributed. 

(10) Sheet “EVA” of Chapter3&4.xls solves a problem where 12 shots are allo- 
cated to 7 targets in 3 salvos, just like sheet “DPShooter.” Set the miss prob- 
ability in each of the three salvos to 0.5 and compare the number of survi- 
vors in the two models. 

Ans: There are 1.295 survivors without EVA or 0.876 with EVA. 

(11) Use sheet “Optimal” of Chapter 3 &4.xls to find the upper bound on target 
value killed for the problem specified in Exercise 4, as well as the myopic so- 
lution of a firing problem where information feedback is of the SLS type. This 
will require inputting a 10x8 matrix, with the last row being the values of the 
eight targets (false targets need not be input, since the allocations to them 
should clearly be 0). DO NOT press the “Compute Optimal” button, lest you 
hang up your computer on this large problem. Instead, press the “Compute 
Myopic and Upper” button to find the objective function in the two cases. 
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Ans: The myopic value is 446.88 and the upper bound is 450. Although the 
optimal firing policy for the shoot- look- shoot case is still unknown, the re- 
sults are tightly constrained on both ends. The upper bound applies regard- 
less of the type of feedback. 




Chapter 4 
Target Defense 



Circle the wagons! 
John Wayne 



4.1 Introduction 



In this chapter we consider problems where multiple attackers are approaching a 
target, hoping to kill it by overwhelming its defenses. The defense is armed with 
some anti-attacker weapons called interceptors, each of which can kill only the at- 
tacker to which it is assigned. The goal is to use the interceptors to maximize the 
survival probability of the target. Instead of assigning weapons to nondescript tar- 
gets, as in Chapters 2 and 3, the defense now assigns them to attackers. The “tar- 
gets” are now the things being defended. 

One can also imagine situations where a target is to be defended by setting up 
screens or barriers that are not exhausted by a series of attacks. A convoy of ships 
might have a screen of destroyers designed to protect it from submarines, for ex- 
ample, or an important political leader might have a screen of agents designed to 
protect him from attack. Although such problems are well described as “target de- 
fense,” they are analytically similar to Search theory problems, for which the 
reader is referred to Chapters 7 and 10. Problems in this chapter are characterized 
by having some kind of a resource that is gradually exhausted as attackers are en- 
gaged. 

Most of the methods outlined below have been developed for situations where 
the defenders are anti-ballistic missiles (ABMs) and the attackers are either air- 
craft or missiles, particularly ICBMs, but there are also applications in other cir- 
cumstances. Navy ships, for example, are particularly vulnerable to torpedoes, and 
anti-torpedo-torpedoes (ATTs) have been proposed as a countermeasure. A ship 
trying to defend itself against a group of incoming torpedoes using ATTs faces a 
problem that is analytically similar to a missile silo or city trying to defend itself 
using ABMs. In either case, attackers and defenders (interceptors) are expensive, 
individually assigned objects in short supply. 

A complicating feature is that some attackers may attack the defense, rather 
than the targets directly. It is typical of air-to-ground campaigns, in fact, that the 
first step is usually to attempt to get control of the air by attacking the enemy’s air 
force. The associated analytic problems can become quite complex from the view- 
point of either the attacker or the defender and will not be covered here. In this 
chapter, the target is attacked directly. 
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4 Target Defense 



We will summarize basic results, emphasizing the important case where the 
attack size is unknown. Section 4.2 deals with a single target, section 4.3 deals 
with multiple targets, and Section 4.4 (Further Reading) discusses other results 
available in the literature. 



4.2 Defense of One Target Against Several Identical Attackers 



In this section there is only a single target, so the defender does not have the prob- 
lem of distributing his interceptors over multiple targets. However, since multiple 
interceptors can be assigned to any given attacker, there remains the question of 
how many interceptors to assign to each sequential attacker. Much depends on 
how much is known about the attack size. 



4.2.1 Known Attack Size 

Assume that each of t attackers will independently kill its target with probability 
p = l-q if not intercepted, and that the defender has s interceptors, each of which 

will independently kill an attacker with probability p, and all of which are to be 
used against the t attackers. 

We first consider the situation where all 5 interceptors must be committed in 
one salvo. This is the same situation considered earlier in Section 3.4.1, except 
that the defender’s goal is now to maximize the probability that the single target 
survives. As in Section 3.4.1, the defender should distribute the interceptors as 
evenly as possible over the attackers (Soland, 1987). Let k be the integer part of 
sit, and let r be the remainder: 

5 = kt-l-r, where 0 < r < t . (4.1) 

When the interceptors are distributed as evenly as possible, {t- r) attackers are as- 
signed k interceptors, r are assigned k+ and the probability that the target sur- 
vives is 

Q{s,t)^[\-p{\-p)' Y [l - (1 - pT' ^ . (4.2) 



Example 1: Suppose p =0.8, p = 0.5, s=l, and t = 3. Then k=2 and r = 1. Each 
of the 2 attackers that are assigned 2 interceptors will kill the target with probabil- 
ity p{\- p) = 0.2 , and the target will therefore survive both attackers with prob- 
ability 0.8^ = 0.64 , which is the first [ ] factor in (4.2). The second is 
(l-0.8(0.5)’)‘ =0.9,so 2(7,3) = 0.576. 
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The target survival probability Q{s, t) ean be approximated by permitting non- 
integer alloeations of intereeptors, sit to eaeh attaeker: 

Q{s,t) = {\-p{l-pY‘)\ (4.3) 

Equation (4.3) approximates Q{1, 3) in Example 1 by 0.595; (4.3) will in all cases 
be at least as large as (4.2). 

If sufficient time is available, shoot-look-shoot policies of the type considered 
in Section 3.4. 1 may be possible. The same dynamic programming method used in 
that section will also work here, but with a different objective function. Let 
{s, t) be the maximum target survival probability when n stages, s interceptors, 
and t attackers remain. Since the goal is now maximization, equation (3.8) is re- 
placed by 

K (s, t) = max„^^^^ £(T;_, (s- - V, T)); n > 0 . (4.4) 

With 0 stages left, the number of interceptors remaining is irrelevant because there 
is no further opportunity to use them, and the target is reduced to hoping that all t 
remaining attackers fail in their mission. The probability of this is TJ, (s', t)=\-q‘ . 
If n is greater than 0, then the assignment of x interceptors to the current salvo 
will leave Y attackers, where T is a random variable with the same properties as in 
Section 3.4.1. As in Section 3.4.1, the solution strategy is to use F^(s,t) to com- 
pute (s, t) for all relevant {s,t) pairs, until the stage index n is advanced to the 

number of interceptors initially available. An implementation for three stages is on 
sheet “DPDefender” of Chapter3&4.xls. Except for one additional input (the at- 
tacker kill probability p), sheet “DPDefender” looks and functions just like the 
sheet “DPShooter” that was employed in Section 3.4.1. See Soland (1987) for a 
more detailed derivation of (4.4). 

The number of stages or salvos is sometimes derivable from geometric con- 
siderations. One such circumstance might be called “self-defense,” where attack- 
ers are assumed to proceed at constant speed U directly toward their target, the 
marksman. The marksman launches interceptors with constant speed V at the at- 
tackers. After each engagement, it takes a certain amount of time to assess which 
attackers still survive. During that time, the surviving attackers move toward the 
marksman by a distance A, the product of U and the assessment time. If a salvo of 
interceptors is launched at range R, then the intercept will be at time R!{U + V) , 
since the relative speed is U + V , at which time the interceptors (and therefore the 
attackers) are at range RV l(U + V) . Let a be the ratio V l(U + V) . Allowing for 
attacker movement during the assessment time, we can say that that the next salvo 
of interceptors can be launched at range R'=aR — A, provided that that range is 
large enough to permit launch. See Figure 1. This argument can be repeated to find 
the launch range following R', and so on. If R, is the /th launch range, then Ri is 
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deterniined by either the maximum range of the marksman’s sensor or the maxi- 
mum interceptor range, and succeeding ranges are given by 



1-a 



(4.5) 



as can be shown by induction (Exercise 3). As soon as is too small to permit 
launch of a salvo of interceptors, n is the number of stages to be used in (4.4). 



Launch salvo 2 



Intercept by salvo 1 Launch salvo 1 







CtRi-A ctRi Ri 

Figure 1: There are 2 attackers when salvo 1 is launched, and 1 attacker 
when salvo 2 is launched. 

Example 2: The maximum range of a radar is 6000 m. A battery of anti-aircraft 
missiles is available to defend a ship against attacking surface-to-surface (SSM) 
missiles that approach at 100 m/s. The defending missile speed is 200 m/s. It takes 
2 s to determine the outcome of any engagement, and any SSM that gets within 
200 m of the ship will detonate. How many salvos are possible without risking a 
detonation? To answer, use Rj = 6000 m, a = 2/3 and A = 200 m to calculate the 
sequence of launch ranges in meters: 6000, 3800, 2333, 1356, 704, 269, -21. Al- 
though the sixth salvo can be launched before any SSM detonates, the intercept 
will not occur until range (X/?6= 179 m, which is smaller than 200 m, so only five 
salvos are actually possible. Note that the salvos will not be uniformly spaced in 
time, since the initial engagements take a long time to complete. 

The analysis leading to (4.5) is more general than that in Wagner et al. (1999, 
eh. 13) or Przemieniecki (2000, Section 6.5) in allowing for an assessment ad- 
vance A, but less general in that those two references consider geometric situations 
where the attackers do not proceed directly toward the marksman; i.e., situations 
that are not necessarily self-defense. In such situations see Section 3.4.1, since 
“survival probability” is no longer the appropriate measure of effectiveness. If the 
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attackers are aircraft, it may also be possible to shoot at them on the way out, as 
well as on the way in. 

Suppose now that the attackers arrive one by one, widely separated in time, 
and moving so fast that only one salvo is possible for each attacker. Let m, be the 
number of interceptors allocated to the ith attacker; i=\,...,n in an optimized firing 
schedule. We have argued above in justifying (4.2) that the m, should be as equal 
as possible (a “flaf ’ defense), but the reader may have intuitive feelings that a “ta- 
pered” defense where early attackers see more interceptors would be more desir- 
able. Intuitively, it would be regrettable if there were several unused interceptors 
remaining after the target is killed, since the interceptors have no other use than to 
defend the target. We might therefore expect early attackers to be hit hardest in the 
optimized schedule. In spite of intuition, it remains true that the defense should be 
flat as long as the attack size is known. 

A tapered defense can be useful when the attack size is unknown. Sections 
4.2.2, 4.2.3, and 4.2.4 deal with three versions of the problem where n is un- 
known, each of which leads to a tapered defense of some kind. 



4.2.2 Bayesian Defense 

The attackers are assumed to show up one at a time, forcing the defense to guess 
the number of successors (if any) on each occasion. Much will depend on the de- 
fense’s ability to anticipate the unknown ultimate size of the attack, N. 

The defense has a stockpile of m interceptors, each of which has kill probabil- 
ity p. These interceptors are available for the defense of a single target against a 
sequence of attackers, each of which has kill probability p if not intercepted. It is 
assumed that N does not exceed some known number n, and that the probability 
law for N is known (hence the term “Bayesian,” which implies the existence of a 
prior distribution). Let m, be the number of inceptors allocated to the ith attacker. 
The objective is to determine the firing schedule ni\, ..., m„ that maximizes the 
probability of surviving all N attackers, subject to the constraint that 
OTj 4 --- =m . The shoot-look-shoot analysis of Section 4.2.1 would apply if A 
were known, with N stages because the attackers arrive one at a time. However, 
random variable N is unknown, except for its probability law. 

We will solve the problem using dynamic programming. The state of the 
process is (s', i), where s is the number of interceptors remaining and i is the num- 
ber of attackers that have already arrived and failed to kill the target. The decision 
required is to determine the number of interceptors x to use against the next at- 
tacker. The i part of the state is important because if i = n — 1, for example, then it 
is clear that s is the best choice for x, whereas it might otherwise be wise to re- 
serve some interceptors against the possibility of additional attackers. The objec- 
tive function is F{s, i ) , the maximum survival probability of the defended target. 

To develop a recursive formula for F{s,i) , we must first recognize that there 
may not be any future attackers at all, in which case survival is certain. The prob- 
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ability that there will be at least one more attacker, given that i attackers have al- 
ready arrived, is Q, = P(N > i\N >i) . Since the probability law for N is known, 
so are these “continuation probabilities” Qi. If there is at least one more attacker, 
which happens with probability Qi, and if the next attacker does not destroy the 
target, then the next state will be -x,i + ^) ■ The desired recursion is thus 

F{s,i) = \-Q,+ Qi max|[l - p{\- pY^^F {s - + . (4.6) 

The expression in [ ] is the probability that the next attacker does not destroy the 
target. 

It is clear thatF(^,«) = 1 for all s, since survival is certain if all attackers have 
already arrived and failed to kill the target. Equation (4.6) can therefore be used to 
compute E’(^, n - 1) for all s<m, then F(^, n - 2) for all s<m, etc., until fi- 
nally F{s, 0) is obtained. F{m, 0) is then the best overall survival probability. In the 
process of doing the computations, the optimal allocation of interceptors can be 
recorded as the optimal firing schedule. See Exercise 6. 



4.2.3 The Maximum Cost Defense Against a Shoot-Look-Shoot 
Attack 

We assume here, as in Section 4.2.2, that the number of attackers is unknown, and 
that a firing schedule for the defensive interceptors must be set up for use as at- 
tackers arrive one at a time. However, no probability distribution is given for the 
total number of attackers. Instead, the defense takes the point of view that any tar- 
get defended by a finite stockpile of interceptors can be killed if sufficiently many 
attackers are committed, and that the proper goal is therefore to maximize the cost 
(measured in attackers) of killing the target. If this number turns out to be so large 
that the attack does not take place, then so much the better, but in any case the de- 
fensive goal is to make the target as hard to kill as possible. The same defensive 
tactic will also delay any possible kill, thus allowing time for other defensive 
measures. The attacker is assumed to have a shoot- look- shoot capability, so there 
will be no wasted attackers. The defense’s goal is to make sure there are no cheap 
victories. 

The objective of maximizing the average number of attackers required to kill 
the target can be accomplished by once again using dynamic programming, but 
the state description is simpler than in Section 4.2.2. The state of the process is 
simply the number of interceptors m remaining, and the objective function is 
c(m) = “the average number of additional attackers required to kill the target if m 
interceptors remain.” Suppose x interceptors are allocated to the next attacker. The 
probability that the next attacker kills the target is then p{\ - pf , where p and p 
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are the kill probabilities of attackers and interceptors, respectively. If the next at- 
tacker fails to kill the target, then the next state will he m-x. Therefore, since at 
least one attacker is required in any case, 

c{m) = 1+ max j(l-/?(l-/7)"')c(OT-v)| . (4.7) 

0<x<m I ^ J 



If m = 0, (4.6) requires c(0) = l + (l-/?)c(0) , which has the solution c(0) = l/p. 
This is the average number of attackers required to kill an undefended target. For 
m> 0, the option v = 0 can safely be ignored, since at least one interceptor should 
be used in any case. Equation (4.7) can therefore be used to determine c(l), then 
c(2), etc., recording the maximizing value of x at each stage (call it x (in)). Sheet 
“MaxCosf’ of Chapter3&4.xls is an implementation. 

Example 3: Suppose p = 0.8 and /?= 0.5. Then c(0) = 1.25, and (the maximizing 
element is underlined) 

c(l) = l+0.6c(0) = 1.75, and/(l) = l 

c(2) = 1 + max{0 .6c(l) .0.8c(0)l = 2.05, and v*(2) = 1 

c(3) = 1 + max{0.6c(2), 0 .8cm. 0.9c(0)} = 2.40, and/(3) = 2 

Continuing in this manner, we find that c(m) = 1.25, 1.75, 2.05, 2.40, 2.64, 2.92, 
3.16, 3.38, 3.63, 3.84, 4.04, 4.27 for m = 0, 1,..., 11, and also v*(m) = 
0, 1 , 1, 2, 2, 2, 3, 3, 3, 3, 3, 3. If 11 interceptors remain, three should be used 
against the first attacker, then v (8) = 3 should be used against the second, v (5) = 
2 against the third, v (3) = 2 against the fourth, andv (1) = 1 against the fifth. The 
sixth and subsequent attackers would not be opposed, since the defenses are ex- 
hausted. 

The function c{m) is not tactically necessary, even though it is the focus of 
optimization - it suffices to remember the function x*{m). One might, however, 
use c(m) as a measure of effectiveness for making an interceptor quantity versus 
quality decision (see Exercise 7). 

The philosophy of conducting the defense to maximize the cost of killing a 
target sees occasional use in more general combat models. Ravid (1989) applies it 
to arranging anti-aircraft defenses, as outlined in the next example. 

Example 4. Suppose there are two methods of defending an airfield. The “Near” 
defense arranges the defending guns so that they are closely spaced and near the 
airfield. Any attacking aircraft will have to pass near at least one of the guns, so 
the chances of shooting down the aircraft are relatively high (0.2). The trouble 
with this defense is that it acts after the attack, so the bombs will strike the airfield 
even if the aircraft is shot down. The “Far” defense arranges the guns far enough 
away that any attacking aircraft must survive in order to drop its bombs. The trou- 
ble with this defense is that the chances of shooting down the aircraft are relatively 
low (0.1). The problem is to choose one of the two defense types. 

The Far defense would be optimal in a situation where the fate of the aircraft 
is irrelevant, and the only object is to protect the airfield against the next attack. 
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However, the usual context of attacks on airfields is that any airfield can be at- 
tacked if the attacker is willing to pay the price in terms of aircraft lost, in which 
case the defense should have the goal of making successful attacks as expensive as 
possible. The cost of a successful attack against the Near defense is 0.2 aircraft, on 
the average. Over its lifetime, an aircraft against the Far defense will make an av- 
erage of 1/0.1 attacks (the mean of a geometric random variable), but the last one 
will not be successful, so there are only nine successful attacks per aircraft lost, 
and the cost of a successful attack is therefore 1/9. The best defense is revealed to 
be the Near defense, even though it stands no chance of preventing the immediate 
attack. 



4.2.4 Prim-Read Defense 

The assumptions in this section are the same as in Section 4.2.3, except that the at- 
tackers are no longer assumed to have a shoot-look-shoot capability. The attackers 
still arrive sequentially, but a certain number (say n) out of a large stockpile must 
be irrevocably committed to the target. Let p{n) be the probability that the target is 
killed by one of n attackers, and let 2. = max /?(«)/«. /I is the largest possible kill 

n>\ 

probability per attacker. The objective of a Prim-Read defense is to make X as 
small as possible with whatever interceptor stockpile is available to the defense, 
the idea being to prevent cheap kills. The idea was first proposed by Prim and 
Read as a method for defending targets against ICBM attack (Read, 1958; Burr 
etal., 1985). 

The problem of minimizing the defensive stockpile required to achieve a 
given X turns out to be much easier analytically than the problem of minimizing X 
for a given stockpile, so much so that a problem of the latter type is most easily 
solved by guessing values for X until the calculated stockpile is whatever happens 
to be available. This technique is illustrated below. 

Let nil be the number of interceptors allocated to the /th attacker. Making the 
usual independence assumptions, and letting p and p be the kill probabilities of at- 
tackers and interceptors, 

/7(«) = l-fj(l-/?(l-/7)'”'); «>1, (4.8) 

/=1 



and the central problem is to minimize subject to the constraints that 

1=1 

p{n) < Xn for all n > 1 . 

Suppose, for example, that p = 0.8, /?= 0.5, and that there are m = 11 inter- 
ceptors available. Our initial guess might be that 1 1 interceptors should be suffi- 
cient to guarantee that the kill probability per attacker will not exceed (say) 
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X = 0.3. We now consider the problem of minimizing the number of interceptors 
required to guarantee that the maximum kill probability per attacker does not ex- 
ceed 0.3, hoping that the answer is 11. From (4.8), /?(!) = p{\.~ pY' . Since p{\) 
must not exceed 0.3, the smallest possible value for m\ is 2, so we take m\ = 2. We 
therefore have p{2) = 1-0.8(1-/?(1-/?)'"^) . The smallest value of m 2 for which 
p{2)< 0.6 is 1, so we take m 2 = 1. We therefore have 

p{3) = 1-(0.8)(0.6)(1-/?(1 -/?)”’ ) . The smallest value of m3 for which p{l>) < 0.9 
is 1 (p(3) = 0.904 when m3 = 0, which is just barely too large), so we take m3 = 1. 
We can take m, = 0 for i > 4, since the return per attacker cannot exceed 0.25 if 
there are four attackers. The total number of interceptors required to guarantee that 
the kill probability per attacker does not exceed 0.3 is therefore 2 + 1 + 1=4. 
Eleven interceptors are evidently sufficient for a smaller value of /I than the as- 
sumed 0.3. The next step is to guess a smaller value (see Exercise 9) and repeat 
the above calculations. The calculations are easy because the product in (4.8) can 
be formed sequentially, with the first (n - 1) factors being known when m„ is be- 
ing determined. See sheet “PrimRead” of Chapter3&4.xls for a spreadsheet im- 
plementation. The simplicity of the calculations makes up for the fact that they 
must typically be repeated several times. 

Although a Prim-Read defense can certainly be constmcted for a single tar- 
get, the technique is more naturally applied to a group of targets, using the same 
value of A. for every target in the group. If the targets differ in value from each 
other, one simply introduces a target value into (4.8), and A. becomes “maximum 
value killed per attacker.” 

An implicit assumption in setting up a Prim-Read defense is that the attacker 
can determine the defensive firing schedule before making his own allocations. 
There may be good physical reasons for assuming this, as in the ABM problem 
originally considered by Read. However, it may also be true that the attacker has 
just as much trouble ascertaining defensive allocations as vice versa. In the latter 
case, a Prim-Read defense is probably a mistake. The Prim-Read defense of sev- 
eral identical targets would treat all targets equally, for example, whereas the best 
defense may be to abandon half of the targets in order to construct a strong de- 
fense of the remainder, as in Blotto games (Section 6.2.3). 
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The problem of defending a collection of targets from ICBM attack has been 
much studied. There was already enough literature in 1970 to prompt a review 
(Matlin, 1970), and research has continued in the ensuing years as command and 
control improvements have permitted increasingly ambitious planning tools. The 
last three chapters of Eckler and Burr (1972) are devoted to the topic and are a 
good summary of work up until that time. The more recent book by Przemieniecki 
(2000) includes relevant chapters titled “Strategic Defense” and “Theater Missile 
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Defense.” Here we review the basic facts of ICBM technology, and the possible 
applicability of the models introduced earlier. 




ftoeniry V«hlde 



Midcours© Phase 



Terminal Phase 



Boost Defense Segment 



Midcourse.Defenl 



Terminal Defense Segment 



Kinttic En«rgy Airborne L^er 
lnldttoptor&’ 



Aegis Ballistic 
Missile 
Defense 



Ground'Bissd 
Mid course 

Defense 



Multiple Kill 
Vehicles 



PATRIOT Terniinel 

Advanced High Altitude 

Capability^ Area Defense 



Figure 2: United States ballistic missile defense systems current in 2007 
(Source: Ballistic Missile Defeuse Orgauizatiou). 



The active life of an ICBM is generally supposed to have three phases, as il- 
lustrated in Figure 2: 

1. Boost phase. The ICBM is vulnerable during this phase because it is 
emitting lots of heat, and a choice target because killing it will also kill 
all of the re-entry vehicles that it carries. Unfortunately, boost phase oc- 
curs with little warning and does not last long. Kills are therefore hard to 
achieve. 

2. Mid-course phase. This phase is outside the atmosphere, which means 
that light but effective decoys can be deployed. Most of the ICBM’s ac- 
tive time is spent in this phase. The Brilliant Pebbles part of the Strategic 
Defense Initiative (SDI) functions here. 

3. Terminal phase. Once the ICBM payload re-enters the atmosphere, any 
light decoys burn up or slow down, exposing the heavier re-entry vehi- 
cles. Terminal phase is brief, but the compensation for the defense is that 
the targets are clear and the distances to be traveled are relatively short. 
Defense is usually conducted with relatively small, fast missiles such as 
those launched by Patriot. 



The three ICBM phases may last on the order of half an hour in total. Defen- 
sive systems are also sometimes arranged in phases or “layers,” with separate de- 
fensive systems for each phase. 

It is nearly always assumed that each side is aware of the total forces available 
to the other. If the defender knows in addition how the attackers are distributed over 
the targets, then his defense of each target might follow the lines of Section 4.2.1. 
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Kooharian et al. (1969) examine a generalization where the targets differ from 
eaeh other. 

If the defender does not know the distribution of attaekers over targets, then 
the situation ean quiekly beeome eomplieated. Seetions 4.2.2 through 4.2.4 are 
eaeh different methods of defending individual targets without having speeifie in- 
formation about how many attaekers to expeet. 

The Prim-Read defense ean be viewed as a eonservative reaetion to a situa- 
tion where the defenses must be arranged in a manner visible to the attaeker, as 
may be true of terminal intereeptors that must be physieally loeated near their de- 
fended targets. The analysis of Seetion 4.2.4 generalizes easily to the defense of 
several targets, even if they have different values, as long as the attaekers are all 
identieal. Washburn (2005) deseribes a kind of Prim-Read defense for a generali- 
zation where the attaekers are indistinguishable to the defense, but nonetheless of 
mixed types, ineluding deeoys. 

The attaeker may have the option of extending the attaek in time, in whieh 
ease the defender may be eonfronted with situations where it is feasible to destroy 
a given attaeker, but not neeessarily wise to do so. For example, suppose there are 
10 targets, 20 attaekers, and a defense with 10 perfeet intereeptors and the eapabil- 
ity of determining the target to whieh eaeh attaeker is eommitted. The defense 
might employ the prineiple that no target will be killed as long as the eapability to 
intereept attaekers remains, but doing so may not be wise. If that prineiple is em- 
ployed, the attaeker ean engage all 10 targets by first exhausting the defenders and 
then aiming one attaeker at eaeh of the undefended targets. The defender eould 
save several targets by defending only a subset, ignoring any attaekers devoted to 
targets outside the subset (the Blotto games of Seetion 6.2.3 are a formalization of 
this idea). The superiority of this taetie is sometimes reeognized when the targets 
are things like ICBM silos, but it is hard to imagine not defending a speeifie eity 
beeause it does not happen to be in some arbitrary subset. Analyses sometimes as- 
sume implieitly that the prineiple will be employed. Miereourt and Soland (1971), 
for example, assume that area defenses ean be exhausted through the saerifiee of a 
predietable number of attaekers. It should not be forgotten that the defense may 
have better options than employing the prineiple. 

It eould be that neither attaeker nor defender is able to prediet the aetions of 
the other, in whieh ease a two-person zero-sum game results, possibly a Blotto 
game. Chapter 6 ineludes a diseussion of defense problems viewed in this manner. 

Defensive problems become especially complex when the targets, the inter- 
ceptors or the attackers are of diverse types. Recent attempts to deal with this 
complexity are by Bertsekas et al. (2000), who apply an approximate kind of dy- 
namic programming that has the virtue of suffering less from Bellman’s curse of 
dimensionality, and by Brown et al. (2005), who solve a two-sided model using a 
large mixed-integer linear program. The latter is made the basis of a case study in 
Chapter 6. 
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Exercises 

(1) The defender has 13 interceptors and wishes to conduct a defense that will 
maximize the cost of killing a target by attackers whose probability of killing 
the target is 1, if not intercepted. The interceptor kill probability is 0.5, and 
all interceptors for each attacker must be employed in a single salvo. What is 
the maximized cost, and how many interceptors should be assigned to the 
first, second, and third attackers? Use sheet “MaxCosf’ to find the answer. 
Ans. The maximized cost is 4.28 attackers. The first three attackers should 
be assigned 4, 3, and 3 defenders, respectively. 

(2) The attacker kill probability is 0.7 and the defender kill probability is 0.6. 
The defense wishes to conduct a Prim-Read defense for which the maximum 
kill probability per attacker is 0.2. How many defenders are needed? 

Ans. 5. 

(3) Begin with the observation that R' = aR — A and show by induction that for- 
mula (4.5) is correct. 

(4) The maximum range of a ship’s sonar for detecting incoming torpedoes is 
1950 m. A battery of anti-torpedo-torpedoes (ATTs) is available to defend 
the ship against attacking torpedoes that approach at a speed of lOm/s. The 
ATT speed is 5 m/s. It takes 10 s to determine the outcome of any engage- 
ment, and any torpedo that gets within 2 m of the ship will detonate. 

(a) How many salvos are possible without risking a detonation? 

Ans. The sequence of launch ranges is 1950, 600, 150, 0. The launch at 150 
will intercept at range 50, which is larger than 2, so three salvos are possible. 

(b) Suppose that the ship has eight ATTs and that four torpedoes attack. 
Each torpedo will kill the ship with probability 0.5, if not intercepted. Each 
ATT will kill its torpedo target with probability 0.8, regardless of salvo in- 
dex. If the ATTs are employed optimally, what is the ship’s survival prob- 
ability? Ans. 0.9945. Use sheet “DPDefender,” and remember that a DP 
analysis for 12 ATTs and seven torpedoes will also solve all smaller prob- 
lems in the process. 

(5) The “DPDefender” sheet of Chapter 3 &4.xls implements (4.4) for up to 7 at- 
tackers, 12 defenders, and 3 stages, with the p and p parameters of Section 4.1 
being inputs (there is a separate p for each stage). The optimized function 

{s, t) is shown in yellow cells, and the optimizing allocation of defenders 

to targets {s, t) is shown in green cells. Solve several problems and state 
whether you think each of the following propositions is true or false when all 
stages have the same p\ 

(a) For fixed x^(s,t) is always a nondecreasing function of s. “The 
more you have, the more you use.” 
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(b) All attackers are treated equally; that is, (s, t) is always an integer 
multiple of t. 

(c) If S' > t, then (s, t)>t. “If resources permit, shoot at each target at 
least once.” 

(d) For fixed (n, s), (s, t) is a nondecreasing function of t. “The more 

targets there are, the more interceptors you use.” 

Also, give a qualitative description of what happens if the last stage has a 
higher p than the other 2 , as might happen if defenders were more effective 
at close range. 

( 6 ) Suppose that the number of attackers A is random, with P(A = i) = 0.1, 0.3, 
0.4, 0 .2 for i = 0,1, 2,3. The attackers, whatever their number, arrive sequen- 
tially, so that interceptors must be allocated to each attacker as it appears. 
Each interceptor kills its target with probability 0.5. Using (4.6), determine 
the firing schedule that maximizes the probability of destroying all attackers 
if there are four interceptors in total. One way to solve this problem is to 
construct a spreadsheet that solves the problem in general for at most three 
attackers. 

Ans. Set p=l, since this makes the probability of survival equal the probabil- 
ity of killing all attackers. Then iwi = 2, m 2 = 2, m 3 = 0. The probability of 
destroying all attackers is F’(4,0) = 0.55 . Using all four interceptors on the 
first two attackers leaves the defender helpless against a possible third at- 
tacker, but doing so is nonetheless optimal. Intuitively, the chance of having 
three attackers is so low that it is not wise to protect against the possibility. 
One possible spreadsheet solution is sheet “Exercise 6 ” of Chapter3&4.xls. 

(7) Section 4.2.3 includes an example where /?= 0.5, p^ 0.8, and m= 11. 
Would the defender prefer 4 perfect (/>= 1) interceptors to 11 imperfect 
(p= 0.5) ones 

a) Using the measure of effectiveness c(m)? 

b) Using the measure of effectiveness pm, the “average number of 
kills?” 

Ans. The defense prefers perfect interceptors in (a), but not in (b). Note the 
high emphasis that c(m) places on interceptor quality. 

( 8 ) Using a modified version of (4.7), compute c( 8 ) under the assumptions that 
/i = 0.8, p^O.5, and that at most two interceptors can be allocated to any at- 
tacker. Sheet “MaxCost” cannot be used to answer this question, since it 
lacks a constraint on salvo size. 

Ans. c( 8 ) = 3.49. 

(9) Continue the sample analysis begun in Section 4.2.4 by next guessing 
/I = 0.15. You should find that the required firing schedule utilizes 11 
interceptors. Compare your calculations with the firing schedule obtained in 
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Section 4.2.3 for 11 interceptors. The resources are identical in the two 
analyses, but the optimization criterion differs. 

(10) Suppose you must locate anti-aircraft defenses around a depot that may be 
attacked by enemy aircraft. The Near defense will always shoot down the at- 
tacking aircraft, but only after the aircraft has already dropped its bombs. In 
the Far defense, the probability of killing an aircraft is only 2/3, but the ad- 
vantage is that any aircraft killed will not be able to drop their bombs first. 
You decide to arrange the defenses to maximize the average cost, in terms of 
enemy aircraft, of dropping a load of bombs. 

(a) What do you think of the criterion? 

(b) Which is better, the Near defense or the Far defense? 

Ans. The cost of a successful attack is 1 with the Near defense or 2 with the 
Far defense, so the Far defense is better. See Example 4. 




Chapter 5 
Attrition Models 



Another such victory over the 
Romans, and we are undone. 

Pyrrhus 



5.1 Introduction 



Much of combat is about the destruction of enemy forces. The models in this 
chapter ignore all other aspects, including important aspects such as morale and 
maneuver. Victory is a matter of outlasting the enemy. 

Most attrition models are descriptive in the sense that they are not built to op- 
timize any particular tactical decision. The model simply describes how the num- 
bers of the various forces involved will fluctuate with time, generally decreasing 
until some battle termination criterion is met. The purpose of the model may be to 
see how the evolution of the battle depends on the initial numbers and lethality of 
the various weapon systems involved, or to plan logistics or medical support. 

At their simplest, aggregated attrition models can be easily implemented in 
spreadsheets, as will be shown by example in Chapter5.xls. More detailed high- 
resolution models may be tailored to specific terrain, recognize whether there is 
actually a line of sight from shooter to target, incorporate weather and time-of-day 
effects, and take hours to run on powerful computers. In this chapter we deal 
mainly with aggregated models that ignore most of these details. 

An early application of attrition models was to air combat between fighters in 
World War One, and Hughes (2000) states that attrition models are particularly 
applicable to naval combat. Nonetheless, the main applications have historically 
been to land combat. 



5.2 Deterministic Lanchester Models 



Definition: A deterministic Lanchester model is a mathematical model composed 
of a system of ordinary differential equations (ODE). The ODE involve state vari- 
ables that represent the number of entities of various types present in a battle, and 
each of the ODE expresses the time rate of change of one of these variables as a 
function of the others. 

Lanchester (1916) applied ODE to situations where mutual attrition occurs 
continuously in time. He was inspired by air combat between fighters in World 
War One, and particularly wanted to investigate the importance of concentration 
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of forces. This class of ODE-based combat models is named after him, even 
though Ossipov (Helmbold and Rehm, 1995) is equally deserving. 

The general idea in a Lanchester model is to define some state variables rep- 
resenting force levels, determine the ODE for the rate of change of each of these 
variables, and then solve the resulting equations as a function of time. In the case 
of air combat, let the state variables x{t) and y{t) be the number of surviving blue 
and red aircraft at time t on each of two sides, and then hypothesize that the rate of 
change of each variable is proportional to the other. Symbolically, we have 

= -ay(t); x(t) > 0 and = -bx(t); y(t) > 0 . (5.1) 

dt dt 

The proportionality constant a represents the number of blue aircraft shot down by 
each surviving red aircraft per unit of time, and similarly for b with the roles re- 
versed. Each equation holds only when its state variable (the one whose time de- 
rivative is shown on the left-hand side) is positive; once the variable reaches 0, it 
ceases to decrease because the other side has nothing left to shoot at. 

Given initial values for all state variables at time 0, a set of ODE determines 
the state variables for all future time. Sometimes (as in the case of (5.1)) the equa- 
tions can be solved analytically, but in general a numerical solution will be neces- 
sary. Although there are more efficient methods, the only numerical method to be 
used here is Euler’s method where time is made discrete and the differential equa- 
tions are replaced by difference equations. If the time increment is A, the differ- 
ence equation counterpart to (5.1) is 

x(t-l- A) = x(t)-aAy(t) andy(t-l- A) = y(t)-bAx(t). (5.2) 

Since the initial values x(0) and y(0) are given, (5.2) determines x(A) and y(A) by 
substituting into the right-hand side. Withx(A) andy(A) known, (5.2) determines 
x(2A) and y(2A), and so on until a termination condition is reached. The termina- 
tion condition might be that a certain amount of time has expired or that the force 
ratio has increased or decreased to the point where one side or the other would 
surrender. If any computation produces a negative value for a state variable, then 
the state variable is instead taken to be 0. The theory of ODE guarantees that (at 
least for all of the ODE considered here), in the limit as A approaches 0, the dif- 
ference equation solution approaches the true solution of the ODE. 

The reader may wish to experiment with sheet “SquareLaw” of Chapter5.xls, 
which displays both numerical and analytic solutions of (5.1) (the analytic solution 
is given in Exercise 14). The analytic solution is actually the solution of (5.1) 
without the positivity restriction, so it must be used with care. When (say) red be- 
comes negative, the analytic formula for the number of surviving blues will actu- 
ally increase with time, as if red were somehow “unshooting” at blue. The nu- 
merical solution avoids this unwelcome possibility by replacing any tentative 
negative number of survivors with 0. 
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Although the time increment A in (5.2) is meant to be an arbitrary small num- 
ber, sometimes a more thoughtful choice may lead to a more valid representation 
of reality. This is particularly true if combat is conducted in a sequence of power- 
ful discrete ahacks, as at theWorld War Two battle of Midway where strikes were 
conducted by waves of carrier-based attackers. Hughes, for example, has em- 
ployed his “salvo equations” extensively in evaluating naval combat (see his chap- 
ter in Bracken et al. (1995), or Lucas and McGunnigle (2003)). In that case A 
represents the length of a combat cycle, and (5.2) represents the idea that the 
strikes in each cycle are simultaneous - the salvos cross in midair. 

If red strikes blue first in each cycle of combat, it may be best to base the up- 
date fory(t-l- A) in (5.2) on x(t + A), rather than x{t). This would make little differ- 
ence in solving ODE when A is small, but could be a substantial advantage to red 
ifx(t + A) is significantly smaller than x(t). Similarly, if blue strikes red first, the 
update for x(t-l-A) might be based ony(t-l-A), rather thany(t). Exercise 4 shows that 
the order of making updates can be significant. 

Since all state variables are functions of time (t), and since all derivatives are 
with respect to time, it is customary to write ODE like (5.1) in abbreviated form 
where the time argument is omihed on the right-hand side, as we will do in the 
rest of this chapter. 

When ODE involve only two state variables, it may be useful to eliminate 
time by taking the ratio of derivatives. The result in the case of (5.1) is 

— = — ; X > 0 andy > 0. (5.3) 

dx ay 

By separating variables, we discover that the solution of (5.3) is 

(h/a)x^ + (constant), as can be verified by differentiation. Since this must also 
hold at time 0, when x and y are known to be (say) xq andyo, respectively, the con- 
stant of integration must be such that 

a{y^-yo^) = b(x"-x,"). (5.4) 

Equation (5.4) is a relation between x and y that holds throughout the period where 
both state variables are nonnegative and can be used to predict which side will win 
a bahle that ends when one side is reduced to 0. If the “fighting strengths” of the 
two sides are defined to be bx^^ and ayo^, respectively, then whoever has the larger 
fighting strength will win such a battle, and (5.4) can be used to predict the num- 
bers of survivors. Inserting 0 for x in (5.4) and solving fory, for example, we find 
that ay^ = ay^^ ~bx^ , which produces a positive number of red survivors (y) as 
long as red has the larger fighting strength. The dependence of the fighting 
strength on the square of the number of units is responsible for the term “square 
law” in referring to (5.1). 



82 



5 Attrition Models 



Example 1: Suppose Xq = 100, yo = 200, a = 0.03, and b = 0.01. The fighting 
strengths of blue and red are 300 and 400, respeetively, so red will win. Figure 1 
shows the eourse of the battle by plotting y versus x as determined by (5.4). Not 
only does red win, but he has half of his initial foree left when blue is eliminated. 
See sheet “RedBlue” of Chapter5.xls for variations. 




Figure 1: The course of a square law battle with time elimiuated. 



In Example 1, blue loses in spite of the faet that he is only outnumbered by a 
faetor of 2, while eaeh of his units is three times as lethal as one of red’s. The rea- 
son for this is that fighting strength is proportional to the square of the initial 
number, but only to the first power of the lethality eoeffieient. A small foree of 
exeellent units may very well lose to a larger foree of eonventional units. This 
emphasis on numbers has a simple intuitive explanation. Intuitively, there are two 
reasons for blue unit Tom to desire an additional friendly unit Diek in a square law 
battle. One reason is that Diek will shoot at the enemy Harry, perhaps killing 
Harry before Harry ean kill Tom. The other reason is that Harry may shoot at Diek 
instead of Tom; that is, the additional unit will dilute the enemy’s fire. An inerease 
in the lethality eoeffieient fulfills the first funetion, but not the seeond, so (given a 
ehoiee) it is better to inerease the initial number of units. If the square law does 
indeed represent eombat, this is an important observation about the kind of forees 
that ought to be fielded. 
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The square law is sometimes referred to as “aimed fire” beeause the rate at 
whieh blue (say) loses units has nothing to do with x, the number of blue units. It 
is as if red units never had to seareh for blue targets to shoot at. If blue largels 
were hard to find or if red were simply firing into an area without aiming at any 
speeifie blue unit, we would expeet blue losses to be proportional to x, as well as 
y. The resultant “unaimed fire” ODE are 



dx(t) 

dt 



—axy and 



dy(t) 

dt 



-bxy. 



(5.5) 



It is not neeessary to restriet (5.5) to nonnegative values ofv andy - negative 
values are avoided automatieally beeause eaeh time derivative beeomes 0 when its 
state variable beeomes 0. As in the ease of the square law, there are eorresponding 
differenee equations that ean be used to solve (5.5) numerieally (see Exereise 2). 

The parameters a and b do not have the same meaning as in the square law, so 
it does not make sense to eompare solutions of (5.4) and (5.5) with the intention of 
determining whether aimed or unaimed fire is “better” for one side or the other. 

Elimination of time in (5.5) by taking the ratio of derivatives leads to the 
equation dyldx=bla, whieh has the solution 

a{y-yo) = b{x-x„) . (5.6) 

The fighting strength is now bxo for blue or ayo for red, the produet of initial num- 
ber of eombatants and lethality in eaeh ease. The unaimed model is therefore 
sometimes ealled the “linear law,” sinee only the first power of initial number is 
involved. Initial numbers and lethality are emphasized equally in the linear law, so 
it is easier to make up for being outnumbered by inereasing unit lethality. 

The literature eontains many generalizations and different kinds of “laws” be- 
sides square and linear. Deitehman (1962), for example, eonsiders a model of 
guerilla warfare where one side (the guerillas) uses aimed fire while the other uses 
area fire. Cases intermediate between area fire and aimed fire have been eonsid- 
ered, and there is no reason why both types of fire should not be present at the 
same time, as they will be below in Seetion 5.6. There may be multiple entity types 
(infantry and artillery, for example) on eaeh side. Our next example is of this type. 

Example 2 (Battle of the Atlantie): This is a roughly realistie model of the World 
War Two U-boat war in the Atlantie. We assume that German submarines operate 
only in “wolf paeks” and that all Allied ships sail in eseorted eonvoys. Morse and 
Kimball (1950) make the approximation that 5nlc and nc/100 merehant ships and 
U-boats, respeetively, will be lost when a eonvoy defended by c eseorts is attaeked 
by n submarines. These approximations were arrived at statistieally, after examin- 
ing the results of previous eonvoy eneounters. We retain that assumption. Sinee 
the exehange ratio of U-boats lost to eseorts lost in eonvoy engagements was 
about 5/1, we will also assume that the average number of eseorts lost is nc/500. 
Other numbers assumed below were typieal of the time. Let 
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M = cumulative merchant ships sunk (state variable) 

S = remaining submarines (state variable) 

E = remaining escorts (state variable) 

Pc = probability that a convoy is attacked by a given wolf pack = 0.0 1 

tc = time a convoy must be escorted (both directions) = 30 days 

Tc = cycle time for an escort from one convoy to the next = 50 days 

fc = fraction of escorts escorting convoys = tJTc = 0.6 

ts = patrol time for a submarine = 20 days 

Ts = cycle time for a submarine from one patrol to the next = 50 days 

fs = fraction of submarines on patrol = tJTs = 0.4 

r = rate at which convoys leave = 1/day 

m = convoy size = 40 

n = wolf pack size = 4 

= probability of sub loss per cycle due to other causes other than 
convoy engagements = 0.04 

Rs = rate of replacement of submarines (new construction) = 0.7/day 

Re = rate of replacement of escorts (new construction) =0.5/day 

We assume that the following three formulas hold 

Sfjn= number of wolfpacks on patrol 

e = engagement rate counting engagements both ways = (convoy 

sailing rate)(pc)(wolfpacks on patrol) = {2r){p^){Sf^ / n) 
c = escorts/convoy = (i^^)/(rt^) = ii/(r7’J 



There was a debate within the German navy about whether convoys returning 
from Britain to the United States should be engaged, the argument for not doing so 
being that the returning ships were empty. Here we are assuming that engage- 
ments happen in both directions, hence the factor of 2 in the formula for e. Given 
all the above assumptions, the ODE for the three state variables are 



dM 

dt 

dt 

dt 



e{5nlc) = IQrXpJ^iS ! E)- M > 0, 

-e{nc im)-pXIT,+X=-S{[2 pX /(1 007; W + pj X) + Rp, S > Q, 
-e{nc / 500) + R^= -S[2 pX /(5007; )\E + Rp,E>Q. 



These ODE exhibit several new features. M starts at 0 and can only increase, 
since the cumulative losses of merchant ships can only go up. M plays no role in 
the ODE for S and E and serves only as a counter for the progress of the battle. 
Note that the rate of loss of merchant ships is proportional to the square of the sail- 
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ing rate and has nothing to do with the size of the eonvoy. If one were to double 
the eonvoy size and halve the sailing rate, merehant losses would go down sharply 
beeause there would be fewer engagements and also fewer losses per engagement. 
This was basieally the observation that led to inereased eonvoy sizes in World 
War Two. The ODE for S and E involve both positive and negative terms, sinee 
submarines and eseorts were both sunk and replaeed. 

The wolf paek size does not appear in any equation, whieh is superfieially at 
odds with histories of the battle that report devastating effeets of wolf paek intro- 
duetion. However, while the effeet of eneountering a wolfpaek was indeed devas- 
tating for the unlueky eonvoy, the effeet is exaetly balaneed in these equations by 
the eneounter rate’s inverse proportionality to wolf paek size. MeCue (1990) in- 
eludes a more in-depth analysis of this issue. 

Substituting numbers, with time in days. 



dM 

dt 



= 2SIE-,M>Q, 



dt 

dt 



-5(1.6x10 '’.E + 8xl0^") + 0.7; 5 >0, 
-3.2xl0-’£5d-0.5;.E>0. 



Figure 2 shows the solution of these equations for the initial eonditions 
M(0) = 0, 5(0) = 50, ii(0) = 100, exeept that merehant losses are divided by 10 to 
keep all eurves on the same seale. During the 2000 day period, there are 
10 X 250 = 2500 merehant ships sunk. The number of eseorts grows at approxi- 
mately 0.5/day throughout the period, sinee eseort attrition is negligible. The 
number of submarines eventually reaehes a peak and deelines, but not as eatastro- 
phieally or as fast as aetually happened. A major reason for this is that aetually 
inereased strongly with time in WWII, whereas it has been held eonstant here. 

The “Atlantie” sheet of Chapter5.xls (from whieh Figure 2 is taken) permits 
the user to experiment with different initial eonditions or parameters. A great 
many exeursions are possible; e.g., ehange pc to indieate better submarine sensors 
or ehange Re to illustrate the benefit of inereased eonstraetion eapaeity. 

The aetual Battle of the Atlantie exhibited periods of sueeess for the two sides 
as various seeret eodes were broken and teehnologieal innovations were intro- 
dueed. It also had strong geographieal influenees. Our simple Lanehester model 
misses all of this interesting detail, but it does make one thing elear: as long as 
eonvoy engagements are well approximated by the Morse and Kimball equations 
for losses in eonvoy engagements, the submarines are doomed in the long run be- 
eause they eannot sink eseorts fast enough. 

Lanehester models ean have hundreds of state variables, as in Seetion 5.6 be- 
low. They ean also have embedded optimization problems - Should blue’s artil- 
lery shoot at red’s infantry or at red’s artillery? Taylor (1983) is an exeellent in- 
troduetion to the rieh seleetion of possibilities that the ODE paradigm makes 
possible. Sinee our intention here is not to be exhaustive, we pass on to the eon- 
sideration of the stoehastie version. 
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Figure 2: The Battle of the Atlantic. Merchant ships sunk are divided hy 10. 



5.3 Stochastic Lanchester Models 

In solving Lanchester’ s ODE using Euler’s method (Equation 5.2), the time in- 
crement A is ideally a small number. If A is small, the number of casualties in one 
time increment may be similarly small, say 0.1. What does it mean to have 0.1 
casualties? It seems illogical to model something that has to be an integer with a 
real number that could very well be fractional, as is the case when ODE are em- 
ployed. There may be no harm in the practice if the forces engaged are sufficiently 
large, but how are we to know what size is “sufficiently large” unless we are able 
to model the battle with integers in the first place? In fact, why is it that something 
as laden with chance events as a battle would be modeled with any kind of a de- 
terministic model? All of these questions lead to the employment of stochastic 
models. 

One of the charms of modeling battles with Lanchester’ s ODE is that they 
permit a stochastic interpretation that is responsive to criticisms such as those ex- 
pressed above. Letting x = (vi,...,x,t) stand for a vector of k state variables, the 
general form of ODE is 




(5.7) 



Most of the examples considered so far have had k = 2, but that restriction is not 
essential. The deterministic interpretation is that /^(x) is the rate at which x, de- 
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creases. In contrast, the stochastic interpretation is that x, decreases in a small time 
A by a single unit, if at all, with the probability of that event being A/^(x). To state 
the stochastic interpretation more precisely, is taken to be the transition rate 
of a continuous time Markov chain (Ross, 2000). 

The same functions underlie either interpretation, so the stochastic interpreta- 
tion requires no new data. Since the same data underlie either interpretation, we 
use the term “Lanchester model” to refer to either one. In the stochastic version, 
the average losses to x,- in time A are 1(A/J (x)) + 0(1 - A/J (x)) = A/] (x) , which 

agrees with the deterministic version. We can therefore see that the deterministic 
version is obtained from the stochastic version by employing EVA, as described in 
Chapter 1 . In fact, comparing the two versions is a good opportunity to study the 
effects of EVA. 

Example 3: Consider a square law battle (Section 5.2) where xo= 10, yo=8, 
a = 0.9, and b = 0.8. The deterministic theory predicts that blue will always win 
with about 5.29 survivors. Under the stochastic interpretation, red wins about one 
fourth of the time, generally when he gets lucky early. This can be demonstrated 
in a Monte Carlo simulation (Appendix C) where time is repetitively advanced by 
A until one side or the other is wiped out, with survivors being decremented in 
each step based on whatever random numbers are generated. See sheet “Square- 
LawRnd” of Chapter5.xls for a demonstration. If you use SimSheet.xls (Appendix 
C) to perform lots of replications, you will see that the distribution of the net num- 
ber of survivors (x - y) is bimodal, with a positive mode where blue wins and a 
negative mode where red wins. Battles where both sides are nearly wiped out are 
rare. The average net number of blue survivors is about 3.4. 

Example 3 shows that the deterministic and stochastic versions of a small 
Lanchester model can differ strongly. Evidently, the EVA shortcut can cause sig- 
nificant errors in small battles. 

Although Monte Carlo simulation is one way to explore the stochastic inter- 
pretation, analytic computations based on the Chapman-Kolmogorov equations 
(Morse and Kimball, 1950; Ross, 2000) are also possible. Consider any battle 
where each of two sides is represented by one state variable (k = 2). To emphasize 
the integer nature of all state variables, refer to the state as (m,n) rather than (xi, X 2 ). 
If the state is currently (m, n), then the predecessor state must have been either (w, 
n+ 1) or (m -I- 1, n), and the successor state must be either (m- 1, n) or (m, n — 1). 
Figure 3 is part of a state transition diagram showing both of the ways in which 
the state might get to (m, n), as well as both of the ways in which the state might 
leave (m, n). The transition rates between these states are known. For example, the 
rate of going from {m, n-l-1) to {m, n) is -l-l) , the rate at which the second 

state variable decreases from state (m, n-l-1). The Chapman -Kolmogorov equa- 
tions are ODE involving these rates, with the probabilities of being in the various 
states at time t as state variables. Specifically, the equation for P{m, n, t ) , the prob- 
ability of being in state {m, n) at time t, is 
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dP(m «, 0 = y- (^ + 1 , n)p(m + l,n) + (m, n + l)P(m, n + l) 

at 

-{fi (m, n) + {m, n))P{m, n, t). 

Equation (5.8) states that the probability of being in state {m, n) at time t inereases 
beeause of transitions into that state (the first two terms on the right-hand side), 
and deereases beeause of transitions out (the last term). There is a similar equation 
for every possible state. State (0,0) will never be observed beeause the battle stops 
as soon as one side or the other is redueed to 0. Therefore, if the battle starts with 
nif, blue units and «o red units, there are a total of -I- equations. Some 

of these equations omit some of the terms in (5.8) - the first term is missing from 
the equation for state (ot„, 0) , for example, sinee state (ot„ +1,0) is impossible. 

At time 0, all of the probabilities P(m, n, 0) are 0 exeept forP(m„,«„,0), 
whieh is 1.0. Sinee all probabilities are known at time 0, Euler’s method ean be 
used to find all probabilities at time A, and then at time 2A, ete. At the end of the 
implied long eomputational proeess, we will know the probability of being in 
every state at every time. Contrast this with the simpler deterministie interpreta- 
tion, where we diseover how the state of the system evolves by solving a system 
of only 2 ODE. Obviously the stoehastie interpretation leads to the need for eon- 
siderably more eomputation than the deterministie interpretation. Nonetheless, in 
spite of the eomputational diffieulty, the Chapman-Kolmogorov equations are 
solvable in problems that are not too large. 




Figure 3: Transitions into and out of state (m,n). 



Example 4: Consider a square law battle somewhat larger than the one eonsid- 
ered in example 3, with iWo=20, «o^40, a = 0.01, and Zi = 0.02. Thus 
/j(ffi,«) = O.Oln and f 2 {m,n) = 0.02m . The deterministie version has red win- 
ning at time 62.32 with 28.28 survivors. Figures 4 through 6 display the aetual 
probabilities of being in the various states at times 20, 40, and 60. The state is 
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somewhere around (14, 35) at time 20, with little ehanee of the battle being over 
by then. A t time 40, some of the probability has migrated to the m=0 axis, where 
red has already won the battle. By time 60, most of the probability is on that axis. 
By time 80, nearly all of the probability would be on that axis, with red having 
about 30 survivors. The sequenee of pietures ereates the impression of a probabil- 
ity mass that starts at (20,40) and spreads out as it makes its way over to the axis 
where m = 0, to whieh it stieks. At all times the total mass (volume under the 
bulge) is 1 . 

Onee the probability distribution P{m,n,t) is known, various derived quantities 
sueh as the expeeted number of red survivors at time t ean be easily eomputed. 




Figure 4: Probability map at time 20 for a square law battle. 




Figure 5: Probability map at time 40 for a square law battle, 
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Figure 6: Probability map at time 60 for a square law battle. 

As in the deterministic case, it may be useful to eliminate time in models 
where there are only two state variables. The crucial observation is that state 
(m, n) will eventually be succeeded by either state {m, n-1) or state (m-1, «). Ac- 
cording to Markov chain theory (Appendix A), the probability of the latter is 



Q{m,n) 



d(m,n) + fdm,n) ’ 



(5.9) 



with (1 - Q(m, «)) being the probability of the former. These probabilities can then 
be the basis of a Markov chain where time is not involved. They are particularly 
useful if the object of analysis is to evaluate a measure of effectiveness (MOE) 
that depends only on the terminal state of the battle. Such measures include “the 
number of blue survivors” and “the probability that blue wins a fight to the fin- 
ish”. Let MOE(m,n) be the expected value of this terminal measure, given that 
the state is currently (m, n). Then the following equation connects MOE{m,n) to 
corresponding values at each of its successor states: 

MOE(m,n) = Q(m, n)MOE(m - I, n) + (I - Q(m, n))MOE(m, n -1). (5.10) 



Now, MOE(m,n) is assumed to be known in terminal states. For example, if 
MOE(m,n) is the probability that blue wins a battle to the finish, then 
MOii(OT,0) = 1 andMOii(0,n) = 0 . Since MOE(m,n) is known at the terminal 
states, equation (5.10) can first be used to evaluate the function in states that lead 
only to terminal states. After sufficiently many iterations of (5.10), MOE{m,n) 
can be evaluated in any state. The computations are efficient, compared to Monte 
Carlo simulation or to first employing the Chapman-Kolmogorov equations to find 
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the state probabilities at some time large enough to guarantee battle completion 
(perhaps time 100 in Example 3). They can be easily organized in a spreadsheet. 

Example 5: Consider again Example 3, but now our object is compute the exact 
probability that blue wins, and also the average net excess of blue survivors. To 
find these quantities, we employ sheet “Endgame” of Chapter5.xls, which is set up 
to do the calculations for square law battles that proceed until one side is annihi- 
lated. We first insert 0.9 and 0.8 for a and b (or alpha and beta, as they are called 
on that sheet). We next fdl out the green border (terminal) cells of the computa- 
tional rectangle with the appropriate values. To find the probability that blue wins, 
insert 1 in the row where blue wins, and 0 in the column where red wins. Make no 
changes to the interior of the rectangle, which is composed entirely of formula 
(5.10). In those interior cells, Q(m,n) is called leftprob{m, n) because the state 
moves left when blue loses a unit (VBA code in Module 1 implements left- 
prob(m, n) as in (5.9)). The probability that blue wins from state (10, 8) turns out 
to be 0.78. To instead find the net excess of blue survivors, insert m in the row 
where blue wins and -n in the column where red wins. The net excess of blue sur- 
vivors from state (10, 8) is 3.38. These exact numbers are comparable to those 
found earlier in Example 3 by Monte Carlo simulation. 

The deterministic solution is not a good approximation of the stochastic solu- 
tion in Examples 3 and 5, since winning the battle is very much subject to chance. 
The counterparts to Figures 4, 5, and 6 would show an accumulation of probability 
on both axes, rather than only one. The deterministic version does better in Exam- 
ple 4, as we might expect because the numbers of units engaged are larger, but 
even in Example 4 the number of red survivors is significantly random, rather than 
always 28.28 as predicted deterministically (it is never 28.28, in fact, since it is 
always an integer). 

In what sense, then, is the deterministic model an approximation of the sto- 
chastic model? Based on the examples considered so far, we might offer the fol- 
lowing answer: In situations where large numbers of units are involved on both 
sides, and where one of two sides is clearly stronger than the other, the number of 
survivors in the deterministic model is nearly the expected value of the number of 
survivors in the stochastic model. This is not an entirely satisfactory answer, since 
words like “large” and “nearly” are not well defined, but at least it explains why 
the deterministic model typically forecasts a number of survivors that is not an in- 
teger. It also has the advantage of being in accord with experience. Taylor (1983, 
Section 4.16) summarizes that experience and gives several additional references. 

The statement that the deterministic version of a Lanchester model produces 
the expected value of the stochastic version is only approximately correct. The ap- 
proximation is better in some circumstances than others, but at least the error is 
quantifiable. There are other deterministic models, not of the Lanchester type, that 
produce fractional answers to problems involving discrete participants, but where 
there is no associated stochastic interpretation that can serve as a test of ultimate 
truth. Such models should be used with caution, if at all. 
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We have diseussed two different interpretations of a Lanehester model. There 
is a third interpretation that should be avoided. This is the one where Euler’s 
method is used to solve the deterministie version, but the number of survivors is 
rounded to an integer after every time step. This aehieves a eosmetie goal at the 
eost of destroying the integrity of the deterministie solution. The objeet ought to 
be to solve the ODE aeeurately, rather than to enforee an integer property that (in 
the deterministie version) is not appropriate. 



5.4 Data for Lanehester models 



Lanehester models require that the analyst know attrition eoeffieients of various 
kinds. There may be only two sueh parameters in simple eases sueh as the square 
and linear laws, but the number of sueh parameters grows fast with the number of 
state variables. There are basieally two methods that have been used to find sueh 
data, the engineering approaeh and the statistieal approaeh. The two methods are 
not mutually exelusive; the Battle of the Atlantie (Example 2) employs both of 
them. In this seetion we take a eloser look at the methods. 



5.4.1 The Engineering Approach 

In this approaeh, the unknown parameter is expressed as a funetion of other 
known or more easily measurable parameters, with the funetion depending on as- 
sumptions made about the battle. 

Consider Lanehester’s linear law, the law of “unaimed fire.” The parameter a 
is the rate at whieh blue easualties oeeur per blue unit, per red unit. To estimate a, 
we might reason that red is firing into an area of size A without knowing exaetly 
where any blue unit is loeated. If the lethal area (formula 2.3) of eaeh red round is 
L, and if there are x blue units loeated within A, then the average number of easu- 
alties per randomly fired round will be x{L/A). If eaeh red unit fires rounds at rate 
r, and if there are y red units, then the total rate of blue loss is rxy{LIA). The attri- 
tion eoeffieient a is therefore rL/A, the eoeffieient of xy, in units of inverse time. 
Parameter a has been expressed in terms of three more easily measured quantities. 

Alternatively, we might assume that red does aim at individual blue units, but 
that eaeh red unit must seareh the area A randomly to find his next target. If eaeh 
red unit searehes area at rate S, then red will find blue targets at rate xSIA. The to- 
tal red foree will therefore find blue units at the eombined rate xySIA. If the time to 
destroy a blue unit is negligible, we then have Lanehester’s linear law with a be- 
ing SI A, again in units of inverse time (this illustrates that the abstraet name “lin- 
ear law” should be preferred to the more evoeative “unaimed fire,” sinee here we 
have an example where the linear law applies where fire is aimed). If the destrue- 
tion time is not negligible, eall it r. Sinee the average time to find a blue unit is the 
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reciprocal of the rate of finding blue units, the total time for a red unit to dispose 
of a blue unit, counting both search and destruction, is T + AI{xS) . The total attri- 
tion rate is then y !{t + AI{xS)) , or xyS !{A + xlS) . This expression acts like the 
linear law when x is large (attrition is limited by the time for destruction) or the 
square law whenx is small (attrition is limited by the time for search). This is nei- 
ther the square law nor the linear law, but rather a new, more general law. What- 
ever the law is called, the attrition rate can still be quantified by measuring more 
basic parameters. Brackney (1959) explores the behavior of this model in detail. 

The engineering approach is particularly apt if the purpose of studying the 
Lanchester model in the first place is to determine the importance of certain pa- 
rameters. Should current tanks be upgraded to include an advanced target acquisi- 
tion system that will reduce the time x needed to destroy a target? We might pose 
the specific question, “Would it be better to retain the current tank inventory with 
a better (smaller) value of X or to spend the same amount of money buying more 
tanks of the current design?” This is a variation of the quality versus quantity 
question that Lanchester explored in World War I. 

Example 6: Let x be the mean time that it takes for a tank to destroy its target, and 
distinguish between hitting the target and killing it. Each hit kills the tank’s target 
with probability Firing continues until the target is finally killed, but there may 
be multiple hits before that happens (we assume that if a hit on the target does not 
kill it, there is no damage). The time for the first shot and the probability of a hit 
on the first shot are Xi and Pi, respectively. The time for a shot following a miss 
and the probability of a hit on such a shot are and P^, and the time for a shot 
following a hit and the probability of a hit on such a shot are X/, and P/,. This is a 
complicated situation with seven inputs - the sequence of hits and misses might be 
a long one before the last hit finally kills the target. Nonetheless, the mean time to 
complete the process is known. Using Barfoot (1969), the mean time required to 
destroy a target is given by the formula 



1 -?^ 



+- 



1-P. 









(5.11) 



We forego the complete derivation, but (5.11) can be regarded as the sum of three 
terms. The first is the time until the first shot arrives at the target. The second is 
the additional time until the target is first hit (possibly zero - note that the term 
vanishes if Pi^l). The third term is the additional time between the first hit and 
the last (killing) hit, again possibly 0. Assuming that the time to find the next tar- 
get is negligible, or a constant already included in Xi, the corresponding attrition 
rate coefficient is simply the reciprocal of X. Sheet “Tanks” of Chapter5.xls in- 
cludes the computation of this formula in the context of a square law battle where 
you are able to influence the seven parameters that the blue versus red attrition 
rate depends on. Given a budget, you can decide whether to spend it on quality or 
quantity. 
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5.4.2 The Statistical Approach 

The engineering approaeh has its virtues, but there are many possibilities for error. 
Reeonsider what happens when one tank fires at another. Whether the next shot 
misses or not may depend on how big the previous miss distanee was, rather than 
the simple faet of a miss, and may also depend on the distanee between tanks and 
other variables not even eonsidered in (5.11). Damage ean aeeumulate, so that 
multiple hits result in a kill even though no individual hit would do so. A tank tar- 
get ean suffer a mobility kill while still being able to shoot, although it will of 
eourse be more vulnerable onee it is immobile. Detailed, high-resolution simula- 
tions of tank battles aeknowledge all of these phenomena, but the simple model ly- 
ing behind (5.11) does not. Even if the model were fundamentally eorreet, there 
are still possibilities for error in estimating eaeh of the seven parameters that are 
involved. 

If data about tank battles is available, ineluding attrition data, it is tempting to 
use it to estimate the attrition rate eoeffieients direetly. Rather than try to explain 
how attrition happens, we ean treat the proeess as a blaek box and deal with the 
phenomenon itself This is the statistieal approaeh. It applies to all kinds of attri- 
tion proeesses, not just to tank battles. 

A frequently eited example of this approaeh is Engel’s (1954) analysis of the 
battle of Iwo Jima, where an initial foree of about 20,000 Japanese troops was 
eventually annihilated by the United States after about a month of eombat. Only 
the starting level is known for the Japanese forees, sinee more detailed Japanese 
reeords did not survive the battle, but the number of US troops involved is known 
for every day of the battle. Engel’s idea was to model the battle as one ruled by the 
square law, exeept for a eorreetion term required by the multiple landings of US 
troops. There are only two unknown parameters, so why not seareh for the pair 
that does the best job of matehing the known US foree levels? 

Figure 7 shows the result of Engel’s matehing. For the fit to US aetive troops, 
the two parameters were a = 0.0544 US easualties per Japanese man-day, and 
b = 0.0106 Japanese easualties per US man-day. The disparity between attrition 
rates might be explained by the faet that Japanese forees were fighting from well 
prepared defensive positions, but similar disparities have been found in battles 
where the roles are not so elearly defined (the Kursk battle eonsidered below, for 
example). In spite of this disparity, easualties on the two sides were about equal. 
Essentially all 20,000 Japanese troops were killed. The US also had about 20,000 
easualties, although only about a quarter of these died. Sinee there were about 
three times as many US troops as Japanese troops during the first week of the 
battle, and sinee the square of three exeeeds the attrition rate ratio of 5, 
Lanehester’s square law eorreetly prediets a US vietory. The fit shown in Figure 7 
is also remarkably good. Sheet “IwoJima” of Chapter5.xls replieates the Iwo Jima 
battle using Engel’s estimated eoeffieients (see Exereise 7). 
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Figure 7: Engel’s square law fit to the battle of Iwo Jima in World War II. 
The increasing portions correspond to US troop landings. There are actually 
two fits, depending on whether one counts US troops still alive or still active, 
with different attrition rates. In either case, it can be seen that Uanchester’s 
square law does a good job of tracking the number of US troops. 



Engel’s analysis was performed well after the battle, but, in prineiple, the 
same attrition eoeffieients that applied to Iwo Jima might have been supposed to 
hold for other similar battles in World War II. In faet, analyses of other World 
War II battles reveal eonsiderable eonsisteney between eoeffieients, even between 
theaters. See Willard (1962) for analyses of battles going all the way baek to the 
year 1618. 

Most of the work that has been done fitting equations to battles is not so flat- 
tering to the Lanehester theory as that of Engel. An example of this is the battle of 
Kursk. 

Example 7: The battle of Kursk was a tremendous 2-week battle in World War II 
that saw the German forees inside Russia switeh from offensive to defensive tae- 
ties at about the eighth day. Both sides kept track of both the number of personnel 
available and the number of casualties on each day, data which have been col- 
lected in CAA (1998). Table 1 shows part of the data. Reinforcements are not 
shown, so troop levels are sometimes surprising - there are more German troops 
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on day 9 than on day 8, for example. Nonetheless, if battles in the aggregate have 
the kind of predietability that Lanchester models attempt to exploit, we should be 
able to prediet daily losses using a simple formula based on troop levels. The 
square law that Engel had such success which involves only two parameters, and 
there is enough data available to estimate them. 



Time (days) 


German level 


German loss 


Soviet level 


Soviet loss 


0 


307,365 


800 


510,252 


130 


1 


301,341 


6,192 


507,698 


8,527 


2 


297,205 


4,302 


498,884 


9,423 


3 


293,960 


3,414 


489,175 


1,0431 


4 


306,659 


2,942 


481,947 


9,547 


5 


303,879 


2,953 


470,762 


11,836 


6 


302,014 


2,040 


460,808 


10,770 


7 


300,050 


2,475 


453,126 


7,754 


8 


298,710 


2,612 


433,813 


19,422 


9 


299,369 


2,051 


423,351 


10,522 


10 


297,395 


2,140 


415,254 


8,723 


11 


296,237 


1,322 


419,374 


4,076 


12 


296,426 


1,350 


416,666 


2,940 


13 


296,350 


949 


415,461 


1,217 


14 


295,750 


1,054 


413,298 


3,260 



Table 1: German and Soviet force levels and losses during the 14-day battle 
of Kursk . Day 0 starts at 1,800 on July 4, 1943. Losses listed for each day are 
during the 24 h preceding its start time. Force levels are as of the start time. 

But fitting a simple formula will not be easy. Note the high Soviet losses on 
day 8, for example. Considering only the force levels involved, which differ little 
from the force levels on day 7 or day 9, it would be hard to explain how such a 
thing could occur. The historical explanation is that day 8 began with the initiation 
of a climatic German armored attack meant to close some pincers on the Kursk sa- 
lient. The attack ultimately failed, but only after the determined Soviets had sacri- 
ficed the requisite forces to stop it. There were also tremendous losses of armor on 
both sides on that day. Part of the statistical problem is that massive attacks like 
the one on day 8 compress time in the sense that there is much more combat activ- 
ity on some days than on others. The surprisingly high German losses on day 1 are 
also remarkable, and the small losses on both sides prior to that are further evi- 
dence that some notion of battle “pace” needs to be acknowledged. 

There are also other modeling questions that need to be dealt with: 

• The losses listed in Table 1 are for the previous 24 h, so perhaps 
each day’s losses should be based on the force levels of the previous 
day. 
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• Supplies of armor and artillery are crucial in such battles, so per- 
haps we should expand the database and include separate attrition 
coefficients for each distinct force component. There would be an 
attrition coefficient, for example, for Soviet artillery against Ger- 
man armor. Section 5.6 will describe an application involving many 
components on each side, with a correspondingly large number of 
attrition coefficients. 

• Losses listed are “total” in the sense that they include noncombat 
losses. We would expect noncombat losses to be independent of the 
enemy force level, rather than being predicted by (say) the square 
law. Should we separate losses into combat and noncombat losses, 
with a separate forecasting formula for each? 

Sheet “Kursk” of Chapter5.xls defines one possibility for dealing with some 
of the issues raised above. That sheet includes the data of Table 1, together with 
some functions needed to fit it. The losses between time t-l and t (“losses on day 
t”) are assumed to be related to the average of the force levels on day t-l and day 
t. The adjustable parameters include a “pace” parameter for each day that meas- 
ures the pace of battle relative to day 1 . These parameters are multiplicative for 
the attrition on both sides on that day. There are 13 such pace parameters, plus two 
more for the attrition rates that define the square law. The idea is to adjust those 
1 5 parameters to minimize some measure of error when forecast losses are com- 
pared to what actually happened in the battle. 

On any given day, let the forecast losses be jil. According to the stochastic 
ideas of Section 5.3, the actual losses X should be a Poisson random variable with 
that mean. Since X is theoretically Poisson, its variance is also jil, so the quantity 
{X- jXfijj. should be a random variable with mean 1. We can therefore make 

our measure of misfit be a sum of 28 such quantities, one for each side and day, 
and then adjust the 15 parameters to minimize this measure of misfit. If our as- 
sumptions about attrition are all correct, it should (when minimized) be on the or- 
der of about 28. On sheet “Kursk” of Chapter5.xls, this is accomplished using Ex- 
cel’s Solver. 

The result of the minimization is that the two attrition parameters are 0.008 
German casualties per Soviet man-day, and 0.038 Soviet casualties per German 
man-day (note the rough correspondence of these attrition rates to Engel’s 
estimates for the battle of Iwo Jima). The minimized misfit is 5,581, much larger 
than the expected 28. The two worst error terms are due to the high German 
casualties on day 1 and the low German (compared to the Soviet) casualties on 
day 8, which together account for almost half of the misfit. Apparently the 
difference of a Poisson random variable from its mean is the least of our problems; 
the square law just does not fit very well, even after adjusting for the variable pace 
of battle. The reader is invited in Exercise 8 to find a better fit to the data in Table 
1, but be warned: the Kursk data have been examined in detail by others (Turkes, 
2000; Lucas and Turkes, 2004) without much success at finding a satisfactory fit. 
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Other battles of World War II have also been studied from a Lanchestrian 
perspective, e.g. Pricker (1998). The battle of Kursk is not alone in being hard to 
fit. A great deal of effort has been spent trying to fit simple attrition models to his- 
torical battles, much of it with similarly disappointing results. Part of the problem 
is that real battles are not designed well from a statistical standpoint. At Kursk, a 
significant problem is that German force levels remain nearly constant throughout 
the battle, and even Soviet force levels do not fluctuate very much. Battles to the 
finish (like Iwo Jima) are better, statistically speaking, but rare. Perhaps more sig- 
nificantly, historical battles are strongly influenced by subtleties that are hard to 
capture in a simple attrition formula and may have been understood only by the 
side initiating the action. Terrain is important in actuality, but missing from 
Lanchester’s formulation. Logistics and morale are also usually missing, but fre- 
quently crucial in actual battles. If battle were as simple as Lanchester would have 
it, a general staff would hardly be necessary. 

It does not follow from the frustrations outlined above that either Lanchester 
models or the statistical approach to estimating parameters is useless. Every study 
of Kursk has confirmed the strong lethality of the German troops involved, a ten- 
dency that was common to other battles involving the German army in World War II. 
The strong importance of numbers is confirmed by the battle of Iwo Jima. Given 
the multi-sided, surprise-laden nature of warfare, any combat model can be expected 
to err significantly when compared to reality. Models, however, approximate, are 
still a prerequisite for thought. 

By the way, in spite of their strong advantage in terms of lethality, the out- 
numbered Germans eventually lost the battle of Kursk. It turned out to be the last 
German offensive on the Eastern front. 



5.5 Aggregation and Valuation 



Combat models are sometimes described as “aggregated” or “high resolution,” but 
aggregation should really be measured on a continuous scale. Any combat model 
that counts things has made a judgment that two things are so similar that they can 
safely be thought of as identical. To the extent that dissimilar things are treated as 
if they were identical, the model is said to be more or less aggregated. 

The Lanchester models introduced earlier in this chapter have counted only 
one kind of entity on each side and are therefore aggregated. There may be no 
harm in this aggregation, even though a closer look at the real world would reveal 
significant disparities among the entities. Suppose, for example, that we initially 
have a square law model where the attrition rates are a and b, and the initial num- 
bers are Xo for blue and yo for red. Suppose further that the blue forces are discov- 
ered to consist of half shooters and half cheerleaders. The cheerleaders perform 
some function that blue considers valuable, but do not shoot at red. If we were to 
leave the cheerleaders out of the battle, and simply double b, halve a, and halve Vq 
to keep track only of the blue shooters, Lanchester’s equations would predict the 
same survivors for red at all times. Since blue forces are reduced by half, but the 
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lethality of the remainder is doubled, red attrition is unchanged (recall Equations 
5.1). Since red’s lethality coefficient is halved, so is attrition to blue, but then blue 
has only half the forces to attrit, so nothing essential about the battle changes. As 
long as we are careful about the meaning and measurement of lethality coeffi- 
cients, it is a matter of convenience whether we count blue cheerleaders or not. 
Aggregation is not dangerous in this circumstance. 

On the other hand, suppose red is able to distinguish blue cheerleaders from 
blue shooters and concentrates all of his fire on the blue shooters in the revised 
battle. This corresponds to doubling b, halving Xo, and not halving a (since all of 
red’s fire is concentrated on the blue shooters). The revised battle now differs 
from the original; blue could very well win the original, but lose the revision (see 
Exercise 9). In this case, aggregation would be seriously misleading. Evidently, an 
important consideration in aggregation should be whether the other side can be 
expected to distinguish between the things being aggregated. 

There is much more to be said about how much, if anything, is to be lost 
through aggregation (Davis, 1995), but sometimes there is no alternative. Engel 
(1954), for example, knew only the total count of Japanese troops initially on Iwo 
Jima. Constructing a model based on discriminating various types of Japanese sol- 
diers was simply not an option. 

In the authors’ experience, the only method of aggregation used in practice is 
the weighted sum. If Xy is the number of entities of type i and subtype j, then the 
equivalent number of aggregated type i entities is taken to be x, = ^ , where 

,/ 

Wij is the “weighf’ of subtype j within the ith type. One of the weights is usually 
selected to be unity, so that x, has the meaning “equivalent number of units of the 
selected subtype.” Sometimes all of the weights are taken to be unity, reflecting 
the judgment that all subtypes are equivalent as far as combat is concerned. The 
Kursk data (CAA, 1998) include counts of tanks for both sides. There were actu- 
ally a variety of tanks involved in that battle, so simply counting tanks is an exam- 
ple of using unity weights for all subtypes. The Kursk data include four fundamen- 
tal components of combat power: combat manpower, armored personnel carriers 
(APCs), tanks, and artillery (only the manpower data are shown in Table 1). The 
discussion in Section 5.4.2 effectively uses the weights (1, 0, 0, 0) for the four 
components, since data about everything except manpower are ignored in that sec- 
tion. To construct a better scalar measure of total combat power, more realistic 
weights might be used. Aggregated Kursk models sometimes use the weights (1, 
5, 20, 40) for the four components (Turkes, 2000), making the military judgment 
that having one additional tank is equivalent to having 20 additional men, etc. 
Whatever the source of the weights, the object in employing them is always to 
simplify through aggregation, even if some validity is lost in the process. 

A weighted sum of components is an aggregated measure of total combat 
power. Consider two blue entities, each of which kills aggregated red combat 
power by killing a variety of red entities. If the first blue entity kills red combat 
power twice as fast as the second, then a good case can be made that the weight 
for the first blue entity should be twice that of the second. More generally, we 
might adopt the principle that a set of weights for the two sides is consistent if 
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each entity’s weight is proportional to the rate at which it kills the other side’s 
weighted combat power. In a square law-type battle where all fire is “aimed,” the 
consistency principle leads to a set of linear equations that can be solved for a 
nearly unique set of weights (Howes and Thrall, 1973). The essential mathemati- 
cal problem in deriving these weights is that of finding the eigenvalue of a matrix, 
so the method is usually referred to as the eigenvalue method (there is also an 
ATCAL method, see CAA, 1983). The beauty of such methods is that they relieve 
the analyst of the need for making a priori judgments about military value, since 
the equations themselves determine proper weights. 

In spite of its advantages, the consistency principle must be used with care. 
The eigenvalue method applies only to square law battles, and its logical basis be- 
comes questionable if at any time some force component is reduced to zero. Also, 
the consistency principle would have the combat value of a truck being zero, since 
trucks do not actually shoot at anything. Logistic entities such as trucks are none- 
theless frequent targets of enemy fire in the real world, and with good reason. De- 
pending on purpose and circumstances, military judgment might provide a better 
set of weights. 



5.6 The FAst THeater Model (FATHM) 



The Department of Defense employs many models of combat in the defense plan- 
ning process, one of which is FATHM. Although FATHM is a minor model in 
terms of usage, it has the advantage of having a simple structure that employs 
many of the features described earlier, so it makes a good case study. This section 
provides a concise description. Further details can in found in Brown and 
Washburn (2007). 

FATHM is a deterministic model of large-scale air-ground combat. It is a 
simple model in the sense that it makes practically no reference to anything geo- 
graphic - there is no front line, nor any reference to the range at which a weapon 
system is effective, nor any representation of terrain. It nonetheless incorporates 
considerable detail. FATHM includes data about a variety of platform types (tanks 
of various kinds, artillery, etc.) among its inputs, and its purpose is to determine 
the significance of these inputs. Three things happen in sequence in each of 
FATHM ’s three-day time steps: 

• There is a battle between red and blue ground forces. This is the 
“ground” battle. 

• There is a battle between the blue air force and the red ground force (blue 
control of the air is assumed, so there is no red air force). This is the “air” 
battle. 
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• Reinforcements arrive and a phase-change determination is made (wars 
are presumed to have phases, and the phase affects the nature of the 
ground battle). 

Air and ground battles are intermixed in reality, but are imagined to be sequen- 
tial in FATHM for practical reasons. Specifically, FATHM attempts to capitalize 
on the pre-existence of the COmbat SAmple GEnerator (COSAGE), a ground 
model, and the Conventional Eorces Assessment Model (CFAM), an air model, in- 
tegrating the two into an air-ground model that runs quickly on a computer. Speed 
is important on account of the anticipated use where different blue orders of battle 
are compared. The idea that air and ground are intermixed in time would be better 
modeled by shortening the time step, but this would cause FATHM to slow down. 
The choice of 3 days is a compromise between validity and efficiency. 

The air and ground models each extensively employ EVA. In each battle in 
each time period, the number of surviving platforms is first calculated by a method 
that might be justified as being an accurate computation of the expected number of 
surviving platforms, but then this expected number is simply used as the actual 
number entering the next battle, even if it is not an integer. 



5.6.1 FATHM’s Ground Model 

COSAGE is a high-resolution Monte Carlo simulation of ground combat, gener- 
ally over a 2-day time interval, including all of the geographical features that are 
missing in FATHM. On account of its realism, it is far too slow to consider incor- 
porating directly. Instead, FATHM approximates COSAGE with a deterministic 
Lanchester model. As far as FATHM is concerned, COSAGE is the reality that the 
Lanchester model must imitate. 

COSAGE outputs include a “killer-victim” scoreboard that reports the total 
number of red platforms of each type killed by blue platforms of each type, and 
whether these kills are due to aimed (direct) or unaimed (indirect) fire. Indirect 
fire is usually artillery. Let 7^,, be the reported number of red platforms of type r 
killed indirectly by blue platforms of type b. Also suppose that the initial number 
of blue platforms of type b is , while the initial number of red platforms of 
type r is . If we equate COSAGE’s indirect fire to Lanchester’s linear law 
(equation (5.5)), blue’s losses should be proportional to the number of red plat- 
forms and also to the number of blue platforms. Therefore, if indiXi^^ is the indirect 
fire coefficient, assuming that the COSAGE battle is 2 days long, we should have 

indir, = . This formula calculates the attrition rate to be used in 

(2 days)5,„R,.„ 

the Lanchester model as a function of quantities shown on the COSAGE killer- 
victim scoreboard. The reason for the “approximately equaf ’ sign is that 2 days is 
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not an infinitesimally small amount of time, as the Lanehester theory demands. 
We will return to this problem shortly, but, for the moment, assume that all of the 
indireet fire eoeffieients for blue against red are given by this formula. The red 
against blue indireet eoeffieients are given by the same formula with b and r re- 
versed (the COSAGE output ineludes data about attrition to blue, as well as red). 

Sinee direet red losses are proportional to blue foree levels, but not to red 
foree levels, the eorresponding formula for the direet fire eoeffieient is 

dir. = , where D. is the number of red losses to direet blue fire ae- 

(2days)5,„ 



eording to the killer-vietim seoreboard. Again, b and r ean be interehanged to ob- 
tain the eoeffieients for direet red fire against blue. 

With all of the eoeffieients determined, we ean now make a Lanehester model 
of the battle. Letting and Rrt be the blue and red numbers of platforms of type r 

dR 

and b, respeetively, at time t, we have — — = '^{indirf^^Bi^^R^^ +dirf^^Bi ^^) , with the 

dt h 



usual proviso that the equation holds only if 7?,., > 0 . Together with the eorre- 
sponding equations for blue loss rates, these equations essentially eonstitute 
FATHM’s ground model. There is a problem, however; if we were to solve these 
equations over a 2-day time period with the same initial forees as used in 
COSAGE, we would not measure the same losses. The reason for this is that 
Lanehester’ s equations are solved with an infinitesimal time inerement, rather than 
a 2-day time inerement. FATFIM therefore “polishes” the raw attrition rates in an 
iterative seheme that ends when the Lanehester battle produees the same losses as 
the COSAGE battle. The polished rates seldom differ mueh from the raw ones. 
These polished rates are then employed throughout all time periods of FATFlM’s 
ground war, whieh may last for a 100 days or more. There is lots of arithmetie to 
do in the proeess, but ordinary differential equations are easy enough to solve nu- 
merieally that the “fasf’ part of FATFlM’s name is still justified. 

The COSAGE war is short enough that no platform type is eompletely wiped 
out, but FATFlM’s war is long enough that provision must be made for this possi- 
bility. Suppose that blue platform b is redueed to 0. No ehange in the indireet fire 
model is neeessary, but the direet eoeffieient now represents the rate at whieh 
eaeh red platform of type r aims lethally at a blue platform type that no longer ex- 
ists. The idea is absurd, so the direet fire eoeffieients are modified in FATFIM to 
proportionally reprogram this direet red fire to other blue target types. Similar 
modifieations are made when red platform types are wiped out. 

FATFIM permits the initial number of platforms to differ from the number 
used in the COSAGE battle. Suppose that all FATFIM inputs are double those of 
COSAGE. Direet attrition will also double, whieh is appropriate, but the indireet 
attrition will quadmple beeause it is proportional to both foree levels. This quad- 
rupling is aetually desirable if all forees are in the same region as in the COSAGE 
battle, sinee the density of targets per unit area will double along with the rate of 
fire, but it is also possible that FATFlM’s user intends to model a battle over a lar- 
ger region. For this reason, FATFlM’s user is required to input a parameter W that 
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represents the frontal width of the battle region. For comparison, the COSAGE 
output also reports the width Wc of the COSAGE battle. The indirect attrition rates 
are then adjusted by multiplying them all by the ratio WqIW. Thus, if the frontal 
width is doubled along with all of the platform numbers, indirect attrition will only 
double. 

In summary, FATFIM’s ground model is a Lanchester model that takes ad- 
vantage of the data available in the COSAGE killer-victim scoreboards, while still 
being able to replicate the results of the COSAGE battle. Adjustments to coeffi- 
cients are made to account for the different time and space scales of the typical 
FATHM battle. 

As mentioned earlier, ground battles have phases that depend on the defen- 
sive/offensive nature of the two sides. There is a separate COSAGE run for each 
phase, along with separate sets of implied attrition coefficients. When the right cri- 
teria have been met, FATHM changes phases by substituting the appropriate set of 
attrition coefficients. 



5.6.2 FATHM’s Air Model 

FATHM’s air battle amounts to an allocation of blue aircraft sorties to red targets 
(enemy platform types such as T-72 tanks). The targets shoot back only in the 
sense that the database recognizes aircraft attrition probabilities that depend on the 
target type. 

The allocation of aircraft to targets is optimized using linear programming 
(Appendix B), in spite of some relationships that are fundamentally nonlinear in 
the real world. For example, suppose a given target is attacked by n sorties, each 
of which has a kill probability of Pk- If the sorties are independent, then “power- 
ing up” would lead to a kill probability of l-(l-P^)” , a nonlinear function of n. 
FATHM instead uses the linear formula hPk, adding a constraint that the overall 
kill probability cannot exceed 1 . This is equivalent to using the upper bound dis- 
cussed in Section 3.4.2 to approximate the number of targets killed. Thus, if the 
kill probability is 2/3, at most 1.5 sorties can be assigned to the target, and the tar- 
get will surely be killed if that number of sorties is assigned to it. 

Forcing the mathematical program to be linear is dictated by FATHM’s ambi- 
tion to be fast. A linear program is the simplest type to solve, and even solving the 
linear program is a significant task because of the large variable count. FATHM’s 
linear program may have on the order of 80,000 variables! Why are there so 
many? Suppose that each of 100 target types could be attacked in one of 100 
ways. Then there would have to be 10,000 variables, since each doubly sub- 
scripted variable would stand for the number of targets of a certain type attacked 
in a certain way. FATHM’s allocation variables actually have six subscripts, so 
the large variable count should not be surprising. There are also a similarly large 
number of constraints. The primary constraint is that the number of sorties cannot 
exceed whatever amount can be generated in 3 days, but there are also constraints 
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associated with the subsidiary goals of killing enough red targets of certain speci- 
fied types to permit the phase of the war to advance, and more constraints in addi- 
tion. Solving even one linear program of this size is not trivial, and a separate lin- 
ear program is required for the air battle in each 3 -day time segment. Solution of a 
nonlinear program in those circumstances would be too time consuming. 

The object of the air battle is to kill as much red target value as possible, so 
red targets must be valued. FATHM’s method of doing this is a hybrid between 
military judgment and the consistency principle of Section 5.5. All platforms on 
both sides are first assigned a static value. One frequently used scale for this static 
value is replacement cost in dollars - military value should not differ much from 
replacement cost because the replacement process is itself budget constrained. In 
each 3 -day time period, each surviving red platform will kill a certain amount of 
blue static value. The dynamic value of that red platform is taken to be that 
amount of blue static value, but multiplied by a constant with units of time that 
represents the time horizon. The total value of the red platform is then the sum of 
its static and dynamic values. In this way the ground model communicates to the 
air model which red targets are particularly crucial to kill. The total value of a red 
truck is simply its static value, but the total value of a red tank will also include a 
term proportional to the rate at which that tank is currently killing blue static 
value. It is not necessary to find a similar total value for blue platforms; since 
FATHM is a one-sided model in terms of decision making. 



5.6.3 Implementation and Usage 

FATFIM is complied FORTRAN code that exploits the capabilities of ExceF"^ 
workbooks for input and output. The user adjusts the force levels, static target val- 
ues, etc., in a multi-sheet workbook, and then outputs comma-delimited fdes for 
later use by FATHM. FATHM reads these files, together with the killer-victim 
scoreboards from prior COSAGE runs, computes the Lanchester attrition coeffi- 
cients as described above, fights the multi-phase war, and then outputs the overall 
results about attrition to another comma-delimited file. This output file can then be 
input to ExceF'^ and displayed or manipulated as the user wishes. Figure 8 is a 
sample graphical output showing which blue platforms are causing the most attri- 
tion. Data can be displayed in other ways using Excel’s Pivot tables. 
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Figure 8: Attrition results of a FATHM battle. Each segmented vertical bar 
shows red platforms killed by each type of blue platform, with the segments 
corresponding to the code shown on the right (originally in color). Shown are 
21 army (usa) platforms, 4 air force (usaf) platforms, and 7 navy (usn) plat- 
forms. All results (including the platform counts) are fictitious. 
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Exercises 

(1) Derive an equation for the number of survivors on the winning side when 
Lanchester’s square law holds and battle continues until one side is wiped 
out. Hint: You will first have to employ a test to determine which side wins, 
and then solve a quadratic equation to determine the number of survivors. 
Check your answer using the “SquareLaw” sheet of Chapter5.xls. 

(2) Suppose that Lanchester’s linear law holds, with =100, a = 0.01, 

and b = 0.02. 

(a) Which side will win if combat continues until one side is wiped out, and 
how many survivors will there be? 

(b) Compute and plot as a function of time the number of survivors on each 
side, for the first 50 units of time. Use Euler’s method with a time increment 
ofO.5. 

Ans. In (a), blue wins with 50 survivors. Neither side is ever wiped out com- 
pletely when the linear law holds, so the two curves in (b) should gradually 
approach the horizontal axis, rather than dive through it. Make sure that your 
answer to (b) is consistent with (a). 

(3) Lanchester (1916), by way of emphasizing the importance of concentration 
of forces, considers a square law example where a = h = 1 and both sides ini- 
tially have 1000 units. If a single battle occurs, then both sides run out at the 
same time. What happens if blue divides his force in two, so that there is a 
first battle with 500 blue units and 1000 red units, followed by a second bat- 
tle with 500 blue units and the red survivors of the first battle? 

Ans. Red wins with 707 survivors. 

(4) Let a=h = 0.1,A=l, = 1000 , and consider the solution of the dif- 

ference equations (5.2). This is a special case of Hughes’ salvo equations 
where the two sides are exactly matched. If (5.2) is solved as given, the two 
sides run out at the same time. What happens if red strikes first in each cy- 
cle; that is, what happens if the update formula for y(t + A) replaces x(t) by 
x{t+A) ? How much of an advantage is this for red? Quantify the answer by 

using a spreadsheet to find the number of red survivors. To be precise, find 
the number of red survivors after the last cycle in which the number of blue 
survivors is still positive. 

Ans. 316 red survivors. 

(5) In Example 5, the “Endgame” sheet of Chapter5.xls was used to find the av- 
erage net difference of blue survivors (3.38). Use the same sheet to find the 
average number of blue survivors and the average number of red survivors. 
Is the difference between them 3.38? 
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Ans. 4.60 blue survivors and 1.22 red survivors, so the differenee is indeed 
3.38. 

(6) Example 6 refers to sheet “Tanks” of Chapter5.xls. The initial number of 
blue tanks in that battle is 20, but change that initial cell to reflect the idea 
that you can have more tanks if you are willing to adopt a cheaper command 
system that takes a longer time to shoot after a miss (larger T^). Specifically, 
let the initial number of tanks be 19+I„. For 0.5 <Xm<2, what is the best 
value of Tm? 

Ans. There are 7.7 blue survivors if = 0.5, and smaller numbers otherwise. 
In this case 19.5 high-quality tanks is better than a larger number of tanks 
that take longer to shoot. 

(7) Sheet ‘Two Jima” of Chapter5.xls replicates the battle of Iwo Jima using 
EngeFs estimated attrition rates. Experiment with the initial number of Japa- 
nese troops. What is the smallest initial number that would have left the US 
with zero active troops after 60 days of fighting? ExceFs Tools/Goal Seek 
function may be of use. 

Ans. 34,732. 

(8) Sheet “Kursk” of Chapter5.xls is set up to minimize the measure of misfit 
described in Example 7. See if you can find a better fit than the one de- 
scribed there, without using too many parameters, or a slightly worse fit that 
uses fewer parameters. You might, for example, try the linear law or even 
the logarithmic law (where attrition is proportional to own force level, not 
the enemy’s). What if all the pace parameters are forced to be 1? Does the fit 
suffer much? 

(9) Compare two square law battles between blue (v) and red (y). In the original 

battle, Vo= 200, 300, 0.04, and 0.01. In the revised battle, 

Vo = 100, yo = 300, a = 0.04, and b = 0.02. The idea in the revised battle is 
that red has figured out that blue’s 200 units include only 100 that actually 
shoot and concentrates all his fire on those shooters. Compare the two bat- 
tles. 

Ans. In a battle to the finish, blue would win the original battle with 132 sur- 
vivors. Red would win the revised battle. 

(10) The y side has 1000 men in each of several forts, with a = 0.02. The x side 
has 4100 men in a single group, with b = 0.01. The x side attacks the forts in 
succession, with the survivors of one battle entering the next. How many 
forts are taken before x is wiped out, assuming each battle proceeds until one 
side is exhausted? Assume first that the square law holds and next that the 
linear law holds. 

Ans. 8 for the square law, 2 for the linear law. 




5 Attrition Models 



The equations dx! dt = -k^ and dy ! dt = -k^ for x, y> 0 could be said to 
hold in “gladiator” combat where there are successive one-on-one combats. 
Compare the deterministic and probabilistic solutions when k\= k 2 = \ and 
K>To) = (3,2). 

Consider a Square law battle where {xQ,yQ,a,b) = (6, 5, 0.8, 0.9) , interpreted 
probabilistically. Compute the average net difference of survivors x-y ai the 
end of the battle. Use the “Endgame” sheet of Chapter5.xls to calculate the 
difference analytically, and also use the “SquareLawRnd” sheet together 
with SimSheet.xls to compute it by Monte Carlo simulation. The results will 
not be exactly the same, but should be close. 

In the Battle of Ein-A-Tirma (Lebanon War 1982), the theater of operations 
was southern Lebanon, which is a mountainous region crisscrossed by nar- 
row, steep, and winding roads. Mechanized units were typically forced to 
move in a single column. An Israeli tank battalion approached the village of 
Ein-A-Tinna from the south assuming that there were no Syrian troops in 
this village. As the first tank in the column turned around a horseshoe bend 
in the road, it was disabled by heavy fire from the village. The second tank 
in the column passed it and took over the position in the front, resuming the 
duel. It was also disabled, as was a third tank when it moved to the front. 
Later on the combat situation became more complex when another Israeli 
force approaching the village from the Northeast misinterpreted the situation 
and initiated a fire exchange with the Israeli forces in the south, causing 
some heavy fratricide casualties. As far as the first part of the battle is con- 
cerned, a small Syrian force (~6 tanks) managed to stop a battalion because 
it was able to exercise all of its fire power while the firepower of the Israeli 
forces was reduced, due to the topography, to a single tank. 

To make a Lanchester model of this situation, let m and n be the numbers of 
surviving Israeli and Syrian tanks, and assume that all Syrian tanks can fire 
at Israeli tanks, while only one (the lead) Israeli tank can return fire. Also as- 
sume that all tanks are equally effective, and that the battle continues until 
the loser is annihilated. Except for the restriction on Israeli fire, Lanchester’ s 
square law would apply. If there are initially two tanks on each side, what is 
the probability that Israel wins? Also, does it matter if some Syrian fire is di- 
rected at Israeli tanks that are not in the lead? 

Ans. Since the length of the battle is not in question, use the Markov formula 
(5.10) with Q(m,n) = n/(l + n) . If MOE(m,n) is the probability that Israel 
wins, then MOE{m, 0) = 1 for m > 0, while MOE(0, n) = 0 for n > 0. After 
applying (5.10) a few times, the answer is MOE(2, 2) = 0.36. Sheet “End- 
game” of Chapter5.xls can be used to do this if the correct transition prob- 
abilities are substituted for the function called “leftprob” on that sheet. It 
does not matter how the Syrian tanks direct their fire, as long as all fire is 
aimed at Israeli tanks. 
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(14) The analytic solution of equations (5.1) over the time period when both state 
variables are positive is 

if fa ) 1 f fa ) 

40 = - Xg-yoJyexp(t4ab) +- x„ + y exp(-t4ab) 

2[ Vh J 2[ ^Ib 

if fb ^ if lb ^ 

y(0 = - >’o-•^o^^-exp(t^/^) +- yg+Xg -exp(-t4ab) 

2^ Va ) 2[ ^la 



Here Xq andjo represent the given starting values v(0) and>’(0). Show by dif- 
ferentiation that (5.1) is indeed satisfied by these analytic equations. 




Chapter 6 

Game Theory and Wargames 



The game ’s afoot! 
Sherlock Holmes 



6.1 Introduction 



Military operations are eondueted in the presenee of uneertainty, mueh of whieh is 
due to unpredietable enemy aetions. The part due to the enemy is a different kind 
of uneertainty than not knowing, for example, whether a roeket will hit its target. 
In the ease of the roeket, we are dealing with a deviee that itself has no feelings or 
desires about the outeome. There are many sourees of that kind of randomness in 
both military and eivilian affairs, all of whieh we attribute here to ehoiees by 
“Mother Nature,” a fietional deeision maker who makes ehoiees unpredietably, 
but never with any regard for their eonsequenees. We often quantify Mother Na- 
ture’s aetions by providing a probability distribution over her ehoiees, as in all 
previous ehapters of this book. 

Uneertainty about enemy aetion deserves a different kind of treatment, sinee 
an enemy by definition has strong feelings about the eombat outeome. Deeision 
making against an enemy differs qualitatively from deeision making against 
Mother Nature. Ships traveling through submarine-infested waters sometimes 
adopt a randomized zigzag eourse, the idea being to make it diffieult for a subma- 
rine to aim a torpedo aeeurately. Ships traveling through waters infested with iee- 
bergs are also at risk, but they do not zigzag, nor do they attempt to behave ran- 
domly in any manner. Both situations involve uneertainty, but there is something 
fundamentally different about the aetions that are ealled for when dealing with a 
sentient enemy. 

Analysts have followed two separate lines of thought in trying to deal with 
multi-sided deeision-making situations. One line of thought is theoretieal and 
leads to game theory (Seetion 6.2). The other is experimental and leads to 
wargaming (Seetion 6.3). Both lines of thought will be seen to have advantages 
and disadvantages. The disadvantages are signifieant, and partially explain why 
most eombat models are aetually single sided. The advantages should not be lost 
sight of 

One advantage of the multi-sided approaeh has to do with the use of eombat 
models to evaluate the effeets of new taeties or equipment. Single-sided eombat 
models are foreed to rely on more or less sophistieated deeision rules for all par- 
tieipants other than the innovating side. These rules typieally attempt to imitate 
past behavior of opponents. If the new taetie or equipment is a signifieant depar- 
ture from the past, the effeetiveness of the innovation may be exaggerated in a 
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single-sided model through lack of sufficient adaptability in these rules. The mod- 
em machine gun would have been devastating against the kind of massed infantry 
charges that occurred in the American Civil War. The ultimate effect of the ma- 
chine gun was to change the nature of infantry tactics to mitigate its effects, but it 
would have been hard to anticipate this change in the rales of a civil war era sin- 
gle-sided model. Multi-sided models have a better chance of anticipating the ap- 
propriate tactical reaction to such innovations, either through the inherent clever- 
ness of humans (in wargaming) or through optimization (in game theory). 



6.2 Game Theory 



Do not be misled by the name - the theory is not aimed merely at parlor games 
such as checkers or rock-paper-scissors, although we will often use such games 
as examples. The whole topic was invented at the end of World War II by von 
Neumann and Morgenstem (1944), who make it plain in their seminal book that 
their ambition is to apply game theory to military and economic situations, as well 
as parlor games. The central feature of all these situations is that the game’s out- 
come is determined by the actions of multiple decision makers, and that they do 
not all desire the same outcome. When formalized by specifying each player’s 
utility as a function of all of the players’ strategic choices, we have what von 
Neumann and Morgenstem call a game. 

Two-person zero-sum (TPZS) games are a special class of game with exactly 
two players who are completely opposed, as in most parlor games. Chess, back- 
gammon, tic-tac-toe, and rock-paper-scissors are examples. So is baseball - even 
though each side or “player” in baseball is composed of several humans, the im- 
portant thing is that all of the humans on each side are in complete agreement as to 
goals. Monopoly (when played by more than two players) is not TPZS. 

The rest of this chapter is devoted entirely to TPZS games. There are two rea- 
sons for this restriction: 

• combat usually involves two opposed sides 

• the TPZS theory is in better shape than the theory of games in 
general 

With regard to the second reason, the curious reader should search the Internet for 
“Prisoner’s dilemma” as an example of a fascinating, two-person nonzero-sum 
game that has been and remains the object of much analysis. In spite of its sim- 
plicity and large amounts of pertinent thought, there is still no commonly accepted 
“optimal” way to play the game. We do not mean to deny the importance of gen- 
eral games, which are being used increasingly to model economic situations such 
as auctions, and which do potentially have military applications. However, the 
TPZS part of game theory will suffice for our purposes here. 
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6.2.1 Matrix Games and Decision Theory 

The rules of any game must speeify the possible aetions or “strategies” available 
to eaeh side, and, as a funetion of these aetions, the payoff to eaeh player. In the 
ease of TPZS games a single payoff funetion suffiees, sinee the players are eom- 
pletely opposed. It is eonventional to define player 1 as the maximizing player and 
player 2 as the minimizing player, in whieh ease the payoff funetion is essentially 
player I’s utility for the games’s outeome. If eaeh player has only a finite number 
of strategies, the rules of the game ean be shown by giving a single matrix that 
represents player I’s payoff, with one row for eaeh of player I’s strategies, and 
one eolumn for eaeh of player 2’s strategies. The entry in the /th row and yth eol- 
umn is usually ealled ay. This is the payoff that player 1 wants to maximize, while 
at the same time player 2 wants to minimize it. 

It happens that the same matrix notation ean also be used to represent deeision 
problems where player 2 is replaeed by Mother Nature. Before eonsidering sueh 
matriees as games, let us review the eonventional deeision theoretie solution. 
Mother Nature ehooses the eolumn, and the deeision maker ehooses the row with- 
out knowing her ehoiee. To quantify Mother Nature’s aetions, we introduee a 
probability distribution y for whieh yj is the probability that Mother Nature 
ehooses eolumn j. The average payoff when row i is ehosen by the deeision maker 

n 

is then avg. = ^ , where n is the number of possible ehoiees by Mother Na- 

j=i 

ture. Player 1 should seleet i to maximize avg„ his expeeted utility. 

Example 1: In World War II, there was a time when aireraft took off earrying 
depth eharges that they hoped to use in sinking submarines. Eaeh depth eharge 
eould be set before takeoff to explode at either a shallow or deep depth. The sub- 
marine target would also be either shallow or deep, depending on how mueh warn- 
ing the submarine got of the approaehing attaek. We wish to model this situation 
as a deeision problem where the aireraft deeides how to set the depth eharge. In 
setting the depth eharge, the aireraft does not know the depth of the submarine that 
it will be used against, but it does know the payoffs shown in Table 1, whieh dis- 
plays the eonditional probability of sinking the submarine as a funetion of the 
depth eharge setting (sh or dp) and the submarine depth (SH or DP). 

In Table 1, the best setting against SH is sh and the best setting against DP is 
dp, but the aireraft does not know whether SH or DP will be the ease when setting 
the depth eharge. The eonventional solution to this dilemma is to introduee a dis- 
tribution y on SH or DP, whieh we take to be y = (0.4, 0.6). Thus, while most 
submarines have managed to be DP by the time the aireraft aetually drops the 
depth eharge, 40% are still shallow (SH) at the time of the attaek. In World War II, 
that partieular distribution was based on operational experienee. Sinee 
avg^h = 0.4x0.54-0.6x0.1 = 0.26 and avgdp = 0.17, the best depth ehange setting 
is sh. The two avg numbers represent uneonditional sink probabilities aeeording to 
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the theorem of total probability, so the aircraft should expect success in 26% of its 
attacks if it sets the depth changes to sh. Three things are true in this example: 

1 . most submarines are DP when encountered, 

2. the best depth charge setting for DP submarines is dp, and 

3. the best depth charge setting is sh. 

This example is actually a true story, except for the assumed precision of the 
numbers. It is often told to illustrate the advantages of a quantitative approach 
based on decision theory (Morse and Kimball, 1950, their Section 3.4.8). In the 
press of combat, it is easy to make the first two observations, and come to the 
“logical” conclusion that the best setting is dp. A more careful analysis leads to 
the opposite conclusion and a significant increase in effectiveness. 



Shallow depth (sh) 
Deep depth (dp) 



Shallow submarine (SH) 


Deep submarine (DP) 


0.5 


0.1 


0.125 


0.2 



Table 1: Conditional probability of sinking tbe snbmarine in fonr cases. 

The assumption that y = (0.4, 0.6) is a crucial part of the analysis in Example 1. 
German submarines of the era routinely reacted to the sighting of an aircraft by 
submerging, but were occasionally surprised on the surface and ended up at a shal- 
low depth when attacked. Thus the SH fraction is essentially “fraction of the time 
surprised,” over which the submarines had no control. Suppose we take away that 
difficulty, which was peculiar to the equipment of the era, give the submarine 
commander a free choice as to depth, and ask again what the depth charge setting 
should be. We can no longer assume y = (0.4, 0.6), but it is still necessary to make 
some kind of assumption about the submarine’s choice. 

One such assumption is the “conservative” assumption of worst-case analysis. 
This amounts to assuming that the submarine will be DP if the setting is sh, or SH 
if the setting is dp. Our conclusion would then be that the best setting is dp, and 
that the submarine will be sunk 0.125 of the time (the sh setting would lead to 
only a 0.1 sink probability). Regardless of tactics, the submarine will be sunk at 
least 0.125 of the time if depth charges are consistently set to dp. 

Worst-case analysis makes two assumptions about player 2’s behavior. One is 
that he is opposed to player 1 . The other assumption is omniscience, since player 2 
is in effect assumed to select a column in full knowledge of player I’s row choice. 
The game theory idea is to retain the first assumption, but not the second. Cer- 
tainly the German submarines of World War II were not omniscient - how could 
the submarine commander know the depth charge setting? Just as the aircraft does 
not know the submarine depth, neither does the submarine know the depth charge 
setting. The two players are treated symmetrically in game theory, with neither 
knowing the choice of the other. 

The trouble is that the assumption that each player acts in ignorance of the 
other’s choice seems at first to go nowhere. The aircraft, thinking that the subma- 
rine prefers the DP setting, might set the depth charge to dp. But then the subma- 
rine, thinking that the aircraft might think that the submarine will choose DP, 
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might actually choose SH. This kind of “if he thinks that I think that he thinks...” 
regression clearly has no end, and cannot be the basis for a theory of action. The 
TPZS theory is not based on it. 

The solution for both sides turns out to be to simply act randomly, albeit with 
a carefully selected kind of randomness, von Neumann (1928) shows that all finite 
TPZS games have a satisfactory notion of solution if such randomized or “mixed” 
strategies are employed. Here is what we mean by “satisfactory”: 

Definition: Given the payoff matrix {ay), an nrxn TPZS game has a solution if 
there exist probability distributions x and y and a number v such that 

m n 

> V for j = and < v for ; = l,...,m. If all conditions are satis- 

1=1 /=i 

fied, the probability distributions are called “optimal mixed strategies,” the num- 
ber V is called the “value of the game,” and (x,y,v) is the solution of the game. 

The first inequality condition in the definition states that, if player 1 selects a 
row according to x, the average payoff will be at least v, no matter what player 2 
does, even if player 2 knows x. The second condition guarantees that the payoff 
will never exceed v if player 2 acts according to y, even if player 1 knows y. 
Taken together, the two conditions establish that v can be guaranteed by each side, 
regardless of the actions of the other, and therefore deserves to be called the 
game’s value. Do not be put off by the word “average” here. There is no implica- 
tion that the game must be played repeatedly. Even if you only gamble once, you 
might as well take the gamble with the largest probability of winning. 

Our depth charge game has a solution, and its value is 7/38 (approximately 
0.18). Player 2 can guarantee that the sink probability will not exceed that number 
by using y = (4/19, 15/19). To confirm that, use that distribution for Mother Na- 
ture’s choice in a conventional decision theory problem (Exercise 1). You should 
find that player 1 has two choices, both of which lead to a sink probability of 7/38. 
In fact, the game theory solution can be thought of as worst-case analysis where a 
“case” is a probability distribution for Mother Nature - there is no worse distribu- 
tion than y. That same game value can be guaranteed by the aircraft using 
x=(3/19, 16/19). If each depth charge is set to be sh or dp according to that distri- 
bution, there is nothing the submarine can do to prevent being sunk 7/38 of the 
time. Even if x is implemented by including a random number generator in the 
depth charge itself, so that no action by the pilot is necessary, and even if one of 
those depth charges is captured and exploited by the submarine force, there is still 
no way to prevent being sunk 7/38 of the time. Neither party needs to keep his op- 
timal mixed strategy secret, although the choice from that distribution on any par- 
ticular play of the game should of course not be revealed. Both mixed strategies 
have the property of equalizing the average payoff as seen by the other player, and 
each can be derived by writing two linear equations to that effect (Exercise 2). 

All finite TPZS games have a solution, as von Neumann (1928) has shown. 
His proof is not constructive, but by now there are efficient ways of computing 
solutions. One way is to solve an optimization problem where one of the players 
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optimizes the variable v, subjeet to appropriate eonstraints from the definition 
above. For example, the problem for player 2 is to minimize v, subjeet to the eon- 
straints that y must be a probability distribution (sums to 1, no negative numbers), 

and ^ V for all i. Sheet “GameSolve” of Chapter6.xls has an implementa- 

j=i 

tion where the user inputs an arbitrary game matrix and Solver finds the solution, 
ineluding the optimal y. The optimal x eould be determined from a similar optimi- 
zation for player 1 and is guaranteed to produee the same game value. Flowever, 
player 1 ’s mixed strategy turns out to be the dual variables in player 2’s minimiza- 
tion, so a single minimization does it all. Sheet “GameSolve” exploits this, report- 
ing both players’ optimal strategies after a single minimization. 

von Neumann’s existenee theorem stipulates that the number of strategies 
must be finite. Some games with infinitely many possible strategies also have so- 
lutions, but some do not. An example of the latter is “piek the largest number.” 
Two players eaeh seeretly write down a number on a pieee of paper. The numbers 
are then eompared, and player 1 wins if and only if his number is bigger than 
player 2’s. The rules are perfeetly well defined, but there is elearly no optimal way 
to play that game, with or without a random number generator. 

This seetion has so far summarized about a eentury’s thought on how to deal 
with eompetitive deeision-making problems. The results were by no means obvi- 
ous before 1928 - von Neumann’s theorem was onee thought to be false by some 
powerful mathematieians. Nonetheless, the faet is that all finite TPZS games are 
now known to have a solution if mixed strategies are permitted. You may find this 
triumph unimpressive. Most people are not attraeted to the idea of making deei- 
sions by randomization, whatever the seeming logieal neeessity. It is the authors’ 
experienee that military offieers, in partieular, are repelled by the idea of respond- 
ing to questions like “Captain, should we turn right or left?” with “Flip a eoin!” 
The idea of basing aetion on a guess at the opponent’s ehoiee is more satisfying, 
espeeially, of eourse, if the guess turns out to be eorreet. While von Neumann has 
shown that it is not neeessary and possibly unwise to go through the proeess of 
trying to outguess your opponent, the faet is that most people would like to try it. 
Evidenee of this is the existenee of the World RPS Soeiety, whieh stages tourna- 
ments where experts at roek-paper-seissors eompete against one another. RPS is 
easily shown to be a TPZS game where the optimal strategy for either side is to se- 
leet eaeh of the options with probability 1/3 (Exereise 3), so any neophyte with a 
random number generator ean avoid losing to the world’s greatest RPS player by 
simply ehoosing strategies at random. The soeiety’s members surely know that, 
but still enjoy playing the game. 

In spite of our human predileetion for attempting to outguess the opponent, 
von Neumann’s theorem still stands. If nothing else, at least take note that the 
game value v is a way of summarizing a eomplieated situation with a single 
equivalent number. In a eombat model, we might say that every meeting between 
an aireraft and a submarine will sink the submarine with probability 7/38, never 
mind the details, as long as both players are rational. Sueh simplifieations are of- 
ten valuable. 
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Example 2: Colonel Blotto has three regiments available for attacking two forts, 
and must decide how many regiments to use on each fort. His opponent has four 
regiments available for defense, which he must also divide between the two forts. 
Each party knows the overall strength of the other, but neither party knows how 
the other’s forces will be divided. Blotto takes a fort if and only if his assignment 
is strictly greater than the defender’s, and his goal is to take as many forts as pos- 
sible. How should he divide his forces? 

We first note that worst-case analysis will result in Blotto taking no forts at 
all, since he is outnumbered. Blotto’s hope lies in the defender’s not being able to 
predict how Blotto will divide his forces. 

To solve this problem as a game, we first identify all of the strategies that are 
feasible for Blotto and his opponent. Blotto has four strategies and his opponent 
has five. Figure 1 shows the payoff matrix, using the notation that bold uv is the 
strategy of using u units on the first fort and v on the second (this usage is an ex- 
ception to our usual policy that bold symbols stand for vectors or matrices). 

After providing the “Gamesolve” sheet with a 4 x 5 game matrix (the row 
and column names are not inputs) and solving the game, we find that a solution is 
X = (0.5, 0, 0.5, 0), y = (0, 0.5, 0, 0.5, 0), and v=0.5. Blotto can guarantee to win 
one fort half the time by flipping a coin to decide between 30 and 12, but cannot 
guarantee anything more than that. An alternative optimal mixed strategy for 
Blotto is to flip a coin between 30 and 03. The defender flips a coin to choose 31 
or 13. 





40 


31 


22 


13 


04 


30 


0 


0 


1 


1 


1 


21 


1 


0 


0 


1 


1 


12 


1 


1 


0 


0 


1 


03 


1 


1 


1 


0 


0 



Figure 1: Showing the number of forts won by Blotto as a function of bis own 
division of forces and bis enemy’s. Strategy names are in bold. 

The power of modem computers is such that games with thousands of strate- 
gies can be routinely solved. The depth charge game, for example, would be solv- 
able even if both sides had thousands of different choices for depth, rather than 
only two. 

In spite of the efficiency of modern linear programming codes, however, 
many games are simply too big to solve in the matrix form described above. If 
Blotto had 10 regiments to attack 10 forts, the payoff matrix would have 92,378 
rows, rather than 4, and the number of rows would be still larger if the 10 regi- 
ments were not identical. As a practical matter, even listing all of Blotto’s possible 
strategies would be a daunting task. Although any finite game can be solved in 
principle by first constmcting its payoff matrix, it will sometimes be advisable to 
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represent the game in a different manner. The following seetions deal with some 
of the possibilities. 



6.2.2 Tree Games and Saddle Points 

Many games involve a sequenee of moves, with eaeh move being a ehoiee for 
player 1, player 2, or Mother Nature (ehanee moves). The rules of sueh games 
may be better expressed in the form of a tree, rather than a matrix. Figure 2 
shows an example. Play begins at the root of the tree, where player 1 must 
ehoose between three branehes, two of whieh lead to a ehoiee for player 2 , and 
one for Mother Nature. Payoffs are shown at the terminal “leaves” of the tree. 
Player 1 maximizes, player 2 minimizes, and Mother Nature randomizes aeeord- 
ing to probabilities shown on the branehes for eaeh of her deeisions (solid dots 
in Figure 2). 




Figure 2: A simple tree game. Circles are moves for the player whose number 
appears within the circle. Solid dots are randomizations, with probabilities 
given on the branches. Play begins at the root and ends at a leaf where fur- 
ther branching is impossible. The number given at each leaf is the payoff. The 
G„ symbols are for discussion only and are not part of the rules: G„ refers to 
the subtree that follows the symbol and G 4 is the root. 

Although play begins at the root, the best way to find optimal strategies is by 
beginning eomputations at the end and doing a “baekward rollup.” Thus, at point 
Gi, player 1 will ehoose 3 rather than 1, so point Gi ean be thought of as a leaf 
where the payoff is 3. The preeeding move for Mother Nature is therefore worth 
the average of 3 and 0, whieh is 1.5. Sinee player 2 prefers 1 to 1.5, point G 2 is 
equivalent to a leaf whose value is 1. The value of G 3 is 0, so the value of the 
game G 4 is the larger of 1, 6 , and 0, whieh is 6 . This logie is automated on sheet 
“Tree” of Chapter6.xls. When the game is played optimally by both sides, player 1 
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selects the middle branch at G 4 , Mother Nature chooses either 10 or 5 according to 
the given probabilities, and the game ends. In effect, G„ is a subgame whose pay- 
offs may include other games. 

The tree game shown in Figure 2 is boring when played (player 2 never even 
gets to choose anything), but still exhibits all of the features of the type. Tree 
games include more interesting games such as tic-tac-toe and chess (which have 
no chance moves), backgammon (which does), and most military board games 
that involve the rolling of dice and a playing board that is at all times visible to 
both sides. Chess, for example, has an initial move for White with 20 branches (16 
pawn moves and 4 knight moves), and eventually payoffs of 1 (White wins), -1 
(Black wins), or 0 (draw) at the leaves. Furthermore, all such games can be solved 
in principle by the backward substitution technique. The values of tic-tac-toe and 
checkers are both known to be 0. The value of chess is unknown but is surely ei- 
ther 1, 0, or -1. The value of backgammon is 0, by symmetry, but the optimal 
strategies for playing the game are unknown. It is only in the case of tic-tac-toe 
where the method of explicitly drawing the tree and using backward substitution 
might actually be practical - the tree in the other cases is far too large to deal with. 
Nonetheless, representing the game as a tree is still a useful conceptual device. 

Tree games can also be represented as matrix games. One of player I’s strate- 
gies for the game of Figure 2, for example, is “first move to G 2 , and then take the 
1 if we ever get to G4.” That strategy is not very smart because of its behavior at 
G4, but all we require of any strategy is that it be a complete rule for choice no 
matter what the moves of the other player. Our initial goal is to simply list them 
all, rather than to seek strategies that are optimal. Player 1 has three other distinct 
strategies, and player 2 also happens to have four strategies. One of player 2’s is 
“take the 1 at G2 or take the 5 at G3.” If player I’s named strategy is matched with 
player 2’s, the payoff is 1. With sufficient patience, we could name all four strate- 
gies for each side and fill out the entire 4x4 payoff matrix for this game. If we do 
that, we will find a matrix element that is simultaneously the largest entry in its 
column and the smallest entry in its row, i.e., a saddle point of the matrix. For the 
game in Figure 2, the row involved will be “at G4, choose to hand it over to 
Mother Nature,” and the value of the saddle point will be 6 . 

If a payoff matrix has a saddle point at {ij), then player 1 can guarantee ay by 
choosing row i, regardless of what player 2 does, and player 2 can guarantee ayhy 
choosing row j, regardless of what player 1 does. It is not necessary to employ 
mixed strategies in such games, since the game value can be guaranteed by each 
side without resort to randomization. Tree games are sometimes called strictly de- 
termined for this reason. There is thus no computational point in representing a 
tree game as a matrix game; nothing new is discovered, and backward substitution 
is a more efficient solution technique anyway. The point here is that tree games 
are simply a special kind of game for which we have a more efficient solution 
method than linear programming. 

Tree games are also sometimes called games of perfect information, the es- 
sential point being that, whenever it comes time for a player to make a choice in a 
tree game, he knows exactly where he is in the tree. An adequate theory for finite 
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tree games (Zermelo, 1912) existed well before 1928, when von Neumann proved 
his theorem for all finite TPZS games. 

While many parlor games are games of perfect information, there are also 
many that are not, and which therefore cannot be represented as trees. These in- 
clude most card games (cannot see the other player’s hand), blind chess (cannot 
see the other player’s pieces), battleship (cannot see where the other player has lo- 
cated his ships), and rock-paper-scissors (cannot see what the other player’s sym- 
bol is, at least not in time to affect one’s own choice). An attempt to represent 
rock-paper-scissors as a tree game would have to put either player 1 or player 2 at 
the root, and that player would lose. 



6.2.3 Solvable Games 



We have presented two methods for solving games, linear programming for the 
matrix games of Section 6.2.1, and back-substitution for the tree games of Section 
6.2.2. However, there are lots of interesting, large TPZS games that are ill-suited 
to either format. Although there have been many solutions of large TPZS games, 
most involve the exploitation of some special feature that makes it unnecessary to 
begin by listing all possible strategies, as in matrix games, or by completely laying 
out a game tree. Without being specific about exactly how this exploitation occurs, 
the object of this section is to describe some of the classes of large TPZS games 
for which solutions are computable as a practical matter. 

Guessing Games. One of the most common direct sources of matrix games is 
where player 2 chooses a number and player 1 is rewarded in some way for guess- 
ing approximately the same thing. The number might be the depth of a submarine, 
as in the depth charge example given above. It could also be a location at which to 
hide, a frequency on which to communicate, a time at which to act, or a road on 
which to travel. The payoff is usually highest when player 1 is nearly correct. The 
value of the game is sometimes computed and used for subsequent investigations. 
For example, an equivalent detection range might be computed as the value of a 
game in which a submarine chooses its own depth and a surface ship chooses the 
depth of a dipping sonar (see Exercise 5). Section 8.5.2 is an example of a kind of 
guessing game that arises in mine warfare. Washburn and Wood (1995) describe a 
guessing game played on a road network. 

Example 3: Player 2 can hide in one of n cells and player 1 has one chance to 
find him. If player 1 searches the wrong cell, the detection probability is 0. If 
player 1 searches cell i and the target is indeed there, the detection probability is 
the given number />,. For « = 3 and p = (0.25, 0.5, 1), the resulting payoff matrix is 
shown below. 
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The solution of this game is x = y = (4/7, 2/7, 1/7) and v = 1/7. In general, the solution 
of such games can be derived from the principle that x, and y, should be inversely 
proportional to /»,. The x part of the solution is in some ways surprising. It might be 
argued that player 1 should favor cells where />, is large, since he is best at searching 
such cells. Exactly the opposite is tme; a better argument is that player 1 should favor 
cells where p-, is small because that is where player 2 will probably hide. 
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Blotto games. An elementary example is given in Section 6.2.2. In general, a 
Blotto game involves a competition in n areas and a total resource for each of the 
two players, x for player 1 and y for player 2. Each player secretly partitions his 
resource over the areas, with (x„y,) being the allocations of players 1 and 2, re- 
spectively, to area i. The payoff to player 1 in area i is some given function 
yi) of the two allocations, and the total payoff is just the sum of the individ- 
ual payoffs. In most applications the payoff at target i is the probability that the ith 
target is killed, with the allocations of the two players being attackers and defend- 
ers, respectively. The meaning of the sum is then “average number of targets 
killed.” The rapid proliferation of strategies with n, x, and y makes a direct ap- 
proach unattractive, but efficient solution methods are nonetheless known. The gen- 
eral idea is to constmct n small solutions, one for each target, rather than a single large 
solution (Beale and Heselden, 1962 , Washburn, 2003b, Chapter 6). 

Most interest in Blotto games comes from the Cold War problem of defend- 
ing a group of targets from attack by ICBMs, in which case the tactic of secretly 
assigning anti-ballistic missile (ABM) defense levels before anything is observed 
about the attack is known as a “pre-allocated defense.” The defense might hope 
for better, since the defense has the advantage of acting later in time. If the entire 
vector X could be observed before dividing up y, the defense could do things like 
abandoning heavily attacked targets in order to save targets less heavily attacked. 
But there are practical difficulties with this idea. Although ICBMs can be tracked 
easily enough, the real threats are the re-entry vehicles (RVs) that they carry. RVs 
can be maneuverable and are released only a few minutes from impact, perhaps 
with decoys that will disguise the RVs until very late in the game. Besides, if all 
ICBMs are not launched at the same time, the known part of the attack may not to- 
tal X, so there will be the question of how many ABMs to save for the future. Con- 
siderations such as these lead to a pre-allocated defense where a certain number of 
ABMs are set aside to defend target i, regardless of what is observed about the at- 
tack or when the attack occurs, as in a Blotto game. In order to simplify the last- 
minute control problem, the defense essentially abandons whatever advantage 
might accrue from acting after the offense. A pre-allocated defense still keeps its 
allocations secret, thus forcing the attacker to guess at how big an attack is actu- 
ally necessary for any particular target. This makes it more effective than the 
Prim-Read defense (Section 4.2.4), which also abandons the advantages of 
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secrecy. Eckler and Burr (1972) include extensive coverage of such missile as- 
signment problems. 

The important thing here is that large Blotto games, whether motivated by 
missile allocation or not, are solvable as a practical matter. 

Markov games. A Markov game is actually a class of subgames where a pair of 
strategy choices, in addition to causing an immediate payoff to player 1, leads to a 
new subgame within the class. Tree games are Markov games, but the Markov 
class is more general in that the subgames need not be games of perfect informa- 
tion, and might repeat. American football is a good example: each play (each sub- 
game) leads to another play until the game is finally over, with some plays leading 
to scores and others not. Although each subgame is small enough to solve as a ma- 
trix game, the game as a whole is much too large to permit solution by that 
method. The computational idea is to work with the subgames, rather than the 
game as a whole, substituting game values whenever a subgame is called for. The 
alert reader will note that there is a potential difficulty in deciding where to start, 
since the value of the subgame one is trying to solve might depend on the values 
of other subgames that have not been solved yet. Indeed, it is easy to exhibit 
Markov games that have fatal circularities of this sort, but there is usually no theo- 
retical difficulty as long as some condition (the clock, in American football) forces 
the game to eventually end. 

The main military application of Markov games has been to combat between 
air forces. Modem fighter bombers are sufficiently flexible that they can be as- 
signed many roles, but they can only do one thing at a time, and must take off with 
weapons appropriate to the mission. There were some very tense naval moments 
in World War II when fleet commanders had to decide whether to launch defen- 
sive fighter aircraft or offensive bombers, with the ultimate payoff depending very 
much on similar decisions made by the enemy. When two air forces are engaged 
in extended combat, one can easily imagine subgames on any given day where the 
two sides try to guess what the other is doing, with a bad guess leading to an unfa- 
vorable subgame on the following day. Berko vitz and Dresher (1959) analyze a 
situation where each side on each day assigns a single type of aircraft to one of 
three roles: ground support, counter-air, or air defense. The US Air Force’s TAG 
CONTENDER model is more ambitious, including more missions and aircraft of 
multiple types. Bracken et al. (1975) review solution methods and propose some 
approximation procedures. 

Continuous Games. Here we assume that the payoff is a continuous function 
A(x,y), of the two players’ strategies. Under suitable conditions (Washburn, 
2003b) optimal mixed strategies still exist, as well as a game value. Such games 
clearly cannot be represented as matrix games, but solutions are nonetheless sur- 
prisingly common. 

Example 4. Suppose that x and y are each confined to the unit interval, and that 
player 1 wins if and only if \ x- y\< 0.2 . The value of this game is 1/3. Player 2 
can enforce this value by choosing y in the set {0, 0.5, 1} equally likely. There is 
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no choice of x for player 1 that covers two of these points, so the game’s value 
cannot exceed 1/3. An optimal strategy for player 1 is to choose x in the set {0.2, 
0.5, 0.8} equally likely. Three subintervals of length 0.4, each one centered on one 
of those points, completely cover the unit interval. Any point y in the unit interval 
must therefore be within 0.2 of one of those numbers, so there is no way for player 
2 to avoid all three of them in choosing y, even if he knows the set. Therefore 
player 1 can guarantee that the payoff will be at least 1/3, so the value of the game 
must be 1/3. Note that the value is not 0.4, the fraction of the unit interval that 
player 1 can cover. This is the influence of end effects. Note also that neither 
player utilizes a continuum of choices, even though such choices are feasible, and 
that the optimal strategies are not unique. 

The “method” used in solving Example 4 was to simply announce the solu- 
tion and then demonstrate its optimality. Ruckle (1983) defines a class of games 
called “geometric games,” and solves many of them by this technique. There are 
also continuous games where the players utilize a full continuum of choices, nota- 
bly duels where each player must choose when to begin firing. There is little to 
emulate here, since the secret is to be clever enough to see the solution in the first 
place, but there is one subclass of continuous games where a more systematic ap- 
proach is possible. 

If A{x,y) is a concave function of x for each y, then it can be shown that any 
mixed strategy for player 1 is dominated by the pure strategy of always using the 
mean of the mixed distribution, that is, player 1 has no use for mixed strategies. It 
follows that V = max^ rnin^, A(x,y) , where the maximum and minimum are taken 

over whatever sets are feasible forx andy. Similarly, if A(x,y) is a convex function 
of y for each x, then player 2 has no use for mixed strategies and 
V = miUj max^ A{x,y) . If both conditions hold, we have a concave-convex game 

where the max,^ min^ and min^, max^ values are equal, i.e., the game has a saddle 
point. The same arguments turn out to be valid in multiple dimensions. 

Suppose there are n opportunities or areas for player 1 to take action, one of 
which he must choose, and that player 2 must defend against all of them. If player 
1 chooses area i and player 2 has defended that area to the level y„ then the payoff 
is some given convex function Player 2 must defend against all possibili- 

ties, but he has only a limited total budget y, which much be partitioned into the n 
areas. The areas might be drug smuggling routes, opportunities for terrorist attack, 
biological agents, regions of the ocean in which a submarine might patrol, or in- 
dependent routes over which a shipment might be made. This class of games is 
named after the latter logistics games.” If v, is the probability that player 1 
chooses area i, then the average payoff is A{x,y) = '^xJ^{y^). Since this is a 

i 

convex function of y for all x, the value of the game is the same as the min^ max^ 
value - player 2 selects y as if player 1 knows y when selecting x, even though 
player 1 does not really have that privilege. Since selecting x when y is known 
amounts to choosing the area with the largest payoff, this is essentially worst-case 
analysis from player 2’s standpoint. The optimal y is the solution to a minimiza- 
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tion problem with variables y and v, the game value. Specifically, player 2 should 
minimize v, subject to the constraints that y is feasible and yj(y,) <v;i = . 

An optimal x will not emerge from player 2’s minimization, but an optimal x 
nonetheless exists. That is, there is some probability distribution x such that, 
even if player 2 knows x, he cannot find any y such that A(x,y) < v. Player 1 
must randomly select an area from this distribution and keep the selection secret 
from player 2; if he does not keep it secret, then player 2 might discover the 
selected area and allocate all of his resources to it, thus making the payoff 
smaller than v. 

Example 5. Player 1 will choose one of four areas in which to attack player 2. 
There will be no damage if the attack is interdicted, but otherwise the damage 
from an attack in area i is where d = (1,2, 3,4). The noninterdiction probability is 
assumed to be exp(-y,) (this is the random search formula - see Section 7.3.2), so 
the average damage to player 2 if player 1 selects area i is yj(y, ) = d,. exp(-y, ) . If 

player 2 has a total resource y = 3 to distribute to the four areas, how should he do 
so? The solution is on sheet “Logistic” of Chapter6.xls, in the form of a minimiza- 
tion problem (Tools/Solver...) that is solved after the inputs are adjusted. The 
value of the game is 1.06, and the optimal y is (0, 0.63, 1.04, 1.33). Most of the 
defense goes to area 4 where the prospects of damage are greatest. Area 1 is not 
defended at all (nor should it be attacked, since the potential damage there is 
smaller than the game value). If you wish, adjust y and re-solve the game to see 
how different defense budgets are allocated. 

Another example of a logistics game is in Section 8.5.3, where there is an ap- 
plication to a kind of mine warfare. 

Logistic games are the simplest, but not the only application of concave- 
convex games to military affairs. As long as player 1 simply chooses an area, con- 
siderable complication is possible in player 2’s allocation problem. Player 2 may 
assign his forces to “missions,” rather than directly to areas, with each mission 
achieving partial effects in multiple areas. Aircraft assigned to search in one area, 
for example, may have no alternative but to overfly other areas while they are in 
transit. Player 2 can also have multiple kinds of forces, with a separate constraint 
for each. Similar games can also be posed on networks, where one player chooses 
an arc to interdict and the other chooses a path through the network, hoping to 
avoid the interdicted arc. As in logistic games, the solution can in all cases be 
found through optimization (Washburn, 2003b). 



6.2.4 Information and Its Effects on Combat 

One of the most important and difficult problems in modem military analysis is to 
realistically incorporate the effects of information and uncertainty. Game theory is 
by no means a panacea, but it does at least deal with the subject. In this section we 
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discuss the problem in general, as well as the possible uses of game theory in deal- 
ing with it. We will also conclude the discussion of missile allocation that was be- 
gun in Chapter 4. 

Consider the finite matrix game (Oy), and let val{tty) be its value. We can 
also define max^. min^.(ay) , the best that player 1 can get if player 2 knows player 
1 ’s row choice when player 2 selects his column, and min^ max.(ay) , the best that 

player 1 can get if he moves after player 2. These three values can differ substan- 
tially, even though they all share the same payoff matrix. They will always be in 
the order 



min^. max. (a,..) < val{ay) < max^. min^.(ay). (6.1) 

For example, the three values for rock-paper-scissors are -1, 0, and +1, respecttively, 
spanning the range from player 1 losing all the time, to winning as often as he 
loses, to winning all the time. These strong differences are entirely due to the dif- 
ferent information states of the players when they make their decisions. 

The strong dependence of the outcome on information is not confined to 
rock-paper-scissors. Section 9.2.2 includes an analysis of a situation where a 
searcher (player 2) attempts to detect an infiltrator (who is player 1, since his sur- 
vival probability is the objective function) using an unmanned aerial vehicle 
(UAV). The UAV must be within the range of ground stations in order to commu- 
nicate with the controller. We can identify three decisions that need to be made, 
two for the searcher and one for the infiltrator: 

1. Z, the locations of the searcher’s ground stations 

2. X, the search plan for the UAVs 

3. R, the infiltration route 

Let P{Z,X,R) be the infiltrator’s survival probability - this function is specified 
in Section 9.2.2, but its precise nature is not important for the moment. Whether 
the infiltrator actually survives depends on who knows what when, as well as this 
function. We can identify several possibilities: 

1 . A TPZS game where (Z,A) for the searcher and R for the infiltrator 
are chosen in mutual ignorance. 

2. A TPZS game where (Z^ is chosen without knowing R, and then 
R is chosen knowing Z (the ground stations might be visible), but 
not knowing X. 

3. A TPZS game where (Z^ is chosen without knowing R, and then 
R is chosen knowing (Z,A). This amounts to worst-case analysis for 
player 2, the searcher. 

4. A TPZS game where Z is chosen without knowing R, then R is 
chosen knowing Z, but not X, then a spy might discover R and re- 
port it to player 2 (the spy succeeds half the time, say), and finally 
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player 2 chooses X, possibly knowing R and possibly not, depend- 
ing on the spy report. 

5. A TPZS game where Z is chosen without knowing R, then R is 
chosen knowing Z, but not X, and finally X is chosen knowing R 
and (of course) Z. 

There are many other possibilities, some of which incorporate decoys, feints, and 
erroneous measurements. All of these situations are perfectly well-defined finite 
TPZS games, so a solution exists in all cases, but most of them are too compli- 
cated to analyze. To solve game number 1 as a matrix game, we would have to 
begin by listing all of the possibilities for (Z, X), itself a large set, and then calcu- 
late the survival probability for every entry in the payoff matrix. Game 5 is a tree 
game, but a discouragingly large one. Game 4 might be the most realistic of those 
listed, but even describing a strategy for player 2 in that game is a challenge, much 
less listing them all. Of the five games, only the third is easily solved, and that be- 
cause the max-min problem can be cast as a simple optimization, as shown in 
Section 9.2.2. 

The example in Section 9.2.2 is sufficiently small that some of the numbered 
games besides 3 could be solved with sufficient effort, but nonetheless two points 
should be understood. First, when information is involved, it is easy to formulate 
realistic problems that are simply too difficult to solve, in spite of being perfectly 
well defined. As a result, we often end up deliberately solving the “wrong” prob- 
lem as a way to at least gain some insight into the “righf’ one. The solution of 
game 3, for example, is at least a lower bound on the value of game 1. Second, the 
value of a game can be distressingly sensitive to who knows what when. That is, 
we should not lose sight of the fact that the wrong problem that we can actually 
solve has significant differences from the right one. When it comes to dealing with 
information, humility is a good stance for a combat modeler. The next case study 
makes the same point in a different, more important problem. 

JOINT DEFENDER, A Case Study. The USA has some ICBM interception ca- 
pabilities, some of which are anti-ballistic missiles (ABMs) located on ships. 
These assets might be of use in a scenario where a rogue state has chosen to 
launch an ICBM attack on the USA or one of its allies. If there is sufficient politi- 
cal warning of such a possibility, we might consider the question of where such 
ships should be located in order to optimize their capability for frustrating such an 
attack. In addition to providing a warning, the political situation might also iden- 
tify the potential ICBM targets, and even perhaps the value of each target on some 
relevant scale, so we can say that the objective is to deploy the ships so that the 
average total value killed by the hypothetical ICBM attack is minimized. The col- 
lection of ship locations is called X. 

The rogue state (player 1) has some choices to make that also influence the 
outcome. The rogue state is supposed to be sufficiently large in extent, compared 
to the range of its ICBMs, that the location of the ICBMs within its territory is 
significant. Before the attack, the ICBMs may be transported from one place to 
another. The transportation plan is called W. It will also be necessary, of course. 
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to decide which ICBM should be aimed at which target. The aiming plan is called 
Y. 

Although our main purpose in modeling is optimal ship deployment, the ac- 
tual assignment of ABMs to the ICBMs is also relevant to the chosen effective- 
ness measure. The ABM assignment plan is called R. Let F(X,R, W, Y)be the 
average value killed, given all of the decisions represented by the four arguments. 

Chronologically, the decisions happen in the order W, X, Y, and finally R. 
We might therefore expect the object of computation to be 
max^y min^ maXy min^ T’(X,R, W, Y), that is, W is selected knowing nothing 
about X, Y, or R, then X is selected knowing only W, etc. This situation might be 
imagined as a game of perfect information, a very large tree game where the root 
node is a max that branches on W, then each branch goes to a min node that 
branches on X, and so on. But it does not follow from chronology that the mini- 
mizer knows Y when selecting R, since ABMs will have to be committed with 
only partial information about which targets are being attacked, and there is good 
reason for the maximizer to keep Y secret. The actual object of computation will 
have to depend on exactly what is assumed about who knows what when. 

Brown et al. (2005) describe JOINT DEFENDER (JD), a model dealing with 
the situation described above. JD is a state-of-the- art minimization that utilizes 
2005-era computers to their capacity, so the reader might expect that JD will make 
realistic assumptions about the information state of each participant when deci- 
sions are made. That is not the case, as can be seen by comparing reality with JD’s 
assumptions. In reality, movements of both ships and ICBMs are subject to dis- 
guise, so we might suppose that X and W are selected in mutual ignorance, at least 
partially. JD instead gives the advantage to the rogue state, assuming that X, along 
with all ship characteristics such as the number of ABMs aboard, is known when 
W and Y are selected. 

In reality, Y is to some extent known when R is selected, since there will be 
radar and satellite observations of ICBM tracks when ABMs must be committed. 
In a Blotto model of this, we might assume that ABMs are randomly but secretly 
committed in advance to defend certain targets, ignoring all tracking information 
except the part that is required to guide an ABM to its target. JD is even more pes- 
simistic, assuming that the ABM preallocation cannot be kept secret, that is, W 
and Y are selected knowing R. Thus, instead of R being partially informed about 
Y, in JD Y is perfectly informed about R. Again, JD gives the advantage entirely 
to the rogue state. 

JD’s final favor to the rogue state is its assumption that each ABM must be 
committed to defending against an attack by a particular missile on a particular 
target, and will not be launched unless that attack occurs. To emphasize the ex- 
treme pessimism of this assumption, suppose the attacker has one perfect ICBM 
and 10 targets to choose from, while the defender has nine perfect ABMs. JD’s de- 
fender cannot adopt the policy of shooting down the lone ICBM regardless of 
what it attacks, but must instead preallocate each ABM to defending one of the 
targets, and then announce the commitments to the attacker. As a result, the at- 
tacker will always find an undefended target and kill it. 
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What are the analytie advantages of JD’s pessimistie (for the minimizer) as- 
sumptions? The value killed when both sides optimize will be 

min^ min^ max^ maXy F(X,R,W, Y) , whieh ean be written as 

min^ R max^y y F(X, R, W, Y) to emphasize that this is basieally worst-ease 

analysis - the minimizer seleets X and R expeeting the worst possible attaek for 
the seleeted values. The sets of alternatives are mueh larger than in the UAV prob- 
lem diseussed above, but JD employs the same worst-ease eriterion. In JD, the sets 
of alternatives are so large that even the min-max problem is not direetly solvable, 
so it is neeessary to approximate T’(X, R, W, Y) with a linear funetion, after 
whieh the inner maximization ean be replaeed by an equivalent minimization 
based on the dual of a linear program. The problem is thus redueed to finding the 
minimum of something over a large set of alternatives (see Brown et al. (2005) for 
details), a very large but at least eonventional problem that JD ean solve exaetly. 

The JD solution will inelude the optimal X, whieh is the best way of station- 
ing the ships. It will also inelude the optimal R. It would not be reasonable to ad- 
voeate aetually using R in the real world - why artifieially restriet the defender to 
the type of polieies eonsidered? However, it might be reasonable to advoeate us- 
ing X, espeeially if the defender is so preponderant that JD ean guarantee a satis- 
faetory result, in spite of all its pessimism. JD is an advisor in a world where advi- 
sors are searee, so its “optimal” solution is at least worth studying. 

In an ideal analytieal world, we would ineorporate the eorreet expression of 
the uneertainty that surrounds deeisions into all our models and optimize or solve 
games as appropriate. That is usually not possible, so we are foreed to employ ap- 
proximations and artifieialities of the type used in JOINT DEFENDER. We “see 
through a glass darkly,” but it still pays to keep looking. 



6.3 Wargames 



A wargame is a more or less abstraet model of warfare that permits humans to 
play the role of eombatants. There are many forms. Commereial wargames often 
take the form of a playing board representing a part of the Earth on whieh pieees 
representing military units are moved around. Military wargames (and to an in- 
ereasing extent eommereial wargames as well) often rely on eomputers to enforee 
realism in unit movements, to assess damage, and to display whatever parts of the 
game's status that the rules permit eaeh player to see. Some wargames ineorporate 
nonplaying humans (referees) to help eontrol the game's evolution, and some do 
not. Some are very simple, while others are eomplex, detailed, and ean require 
several days to play. 

Most wargames are played for reereation, and there is a signifieant eommu- 
nity of hobby wargamers. Several eompanies publish wargames eommereially, 
and there are even magazines {Fire and Movement, Strategy and Tactics) devoted 
to the aetivity. There has been eonsiderable interplay between hobby and profes- 
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sional wargamers, even though the audienees are distinet. Perla (1990) deseribes 
some of the innovations that both groups have introdueed. 

Professional wargaming (wargaming earried out by or for the military) has 
two major purposes. The paramount purpose is edueation and training. Wargame 
partieipants must make deeisions under stress with ineomplete or even false in- 
formation, and must eope with the eaprieious effeets of luek and enemy aetion. 
These eharaeteristies of warfare are diffieult to simulate in any way other than in a 
dynamie, eompetitive simulation that involves humans. The experienee of partiei- 
pating in a realistie wargame is often very intense, so that lessons learned are long 
remembered. At the end of WWII, Adm. Nimitz stated that 

The war with Japan had been re-enacted in the game rooms 
[at the Naval War College] by so many people and in so many 
dijferent ways that nothing that happened during the war was a 
surprise - absolutely nothing except the Kamikaze tactics to- 
wards the end of the war; we had not visualized these. 

All three US serviees, and most other military organizations, utilize wargaming as 
an edueational deviee. 

The teaehing power of wargaming has its dark side. If the stmeture of the 
game being played refleets politieally eorreet assumptions that are not realistie, 
then the lessons learned may be useless or even worse. Braeken (1976) states that 
the initial invasion of Russia in 1941 was so sueeessful beeause Stalin had deereed 
that the only taetie that eould be ineluded in wargames (both sides wargamed the 
invasion beforehand) was a defense of the border. The Soviet Union eventually 
perfeeted the defense-in-depth taeties that were aetually needed, but the early eost 
in terms of eaptured divisions was terrible. Also, wargames will inevitably ignore 
eertain aspeets of reality, and to some extent will thereby teaeh that those aspeets 
are not present in the real world. Fear and panie are not experieneed in wargames, 
for example. Most wargames in the United States are earried out using eommuni- 
eations in English, where nobody runs out of water or toilet paper. That may not 
aetually be the ease. 

Wargaming ean be used for analysis as well as edueation and training. Given 
the eapability to fight abstraet wars, it is tempting to test the effeet of different 
weapons or taeties on the wargaming outeome, perhaps basing proeurement deei- 
sions on results. The tendeney to use wargaming in this manner has fluetuated in 
the past. Wargaming's utility as an analytieal tool is not as obvious as in its use in 
edueation, so the teehnique will probably eontinue to eome in and out of favor. 

The major reeommendation for using wargaming as an analytie tool is that it 
gets around most of the diffieulties mentioned earlier in diseussing game theory. 
Game theory requires that payoffs be explieit funetions of the strategies. 
In wargaming, on the other hand, one ean argue that the payoffs eome with the 
baekgrounds of the players, so that a brief statement about goals (“Take Stalin- 
grad, but whatever you do don't Third Panzer Division”) will suffiee. In game 
theory, the need to initially list all possible ways in whieh a battle might evolve 
is often so daunting as to either inhibit the exereise or lead to oversimplifieation. 
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For this and other reasons, abstract games are often unsolvable, while questions of 
solvability do not arise in wargaming because “optimal” tactics are not sought. 
Since wars are fought by humans, it makes sense to use human players in emulat- 
ing combat, whatever their weaknesses and motivations. If the results reflect those 
weaknesses and motivations, then that, the argument goes, is as it should be. 

The counterarguments are of two kinds, the first having to do with motiva- 
tion. A wargame is only a more or less accurate abstraction of the world. Players 
sometimes express a felt conflict between making decisions as they would be 
made in the real world and making decisions in order to do well in the abstraction. 
These decisions may differ on account of readily perceived modeling deficiencies. 
For example, the scoring of a wargame may not be affected by casualties incurred, 
but only by success in achieving an objective, whereas casualties are always of 
concern in reality. How should those players be advised? The answer is not obvi- 
ous, and the resulting murkiness does not augur well for making decisions about 
tactics or equipment based on the results of play. Player attitudes toward risk are 
particularly worrisome; there is a difference between having a blip labeled “CVN” 
disappear from a computer screen and having an actual aircraft carrier sunk. In 
games where there are possibilities for coalition formation or cooperation, it is 
questionable whether the feelings about honor and guilt that would accompany 
certain actions in the real world can be reproduced in an artificial situation. 

The second counterargument raises technical objections about using wargam- 
ing as a research technique. The idea that it is best to capture decision making that 
is typical, rather than optimal, is particularly questionable in research games that 
represent an unfamiliar reality to the players. There is a natural tendency to over- 
state the effectiveness of new weapons or tactics when played against an inexpert 
enemy who has yet to develop countermeasures. The enemy might very well de- 
velop some tactical countermeasures in the process of repeatedly playing the 
wargame, but wargaming is sufficiently time consuming and expensive that re- 
peated play of the same game by the same players is rare. To some extent this lack 
of repetition is deliberate, since research wargamers often try to avoid player 
learning because they fear that the learning will be about the wargame ’s artificiali- 
ties. As a result, questions of statistical validity are seldom dealt with. Deliberately 
seeking enemy play that is typical, rather than optimal, violates a well-known 
military maxim by designing to the enemy’s intentions instead of the enemy’s ca- 
pabilities. Finally, typical human play often differs substantially from optimal 
play in TPZS games for which the solution is known (Kaufman and Lamb, 1967). 
Thomas and Deemer (1957) downplay the usefulness of wargaming as a tool for 
solving TPZS games, although they admit its usefulness for getting the “feel” of a 
competitive situation. Berkovitz and Dresher (1959) state bluntly that “In our 
opinion ... [wargaming] is not a helpful device for solving a game or getting sig- 
nificant information about the solution.” 

The counter-counterargument, of course, is that, whatever the objections to it, 
research wargaming is often the only feasible technique for examining a genuinely 
competitive situation. Theoretical and experimental approaches each provide in- 
sights into combat, but a great deal of uncertainty must remain. 
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Exercises 



(1) In Section 6.2.1, the distribution y = (4/19. 15/19) is claimed to have 
the property that both of player I’s strategies lead to a kill probability 
of 7/38. Show that this is the case. 

(2) In Section 6.2.1, it is announced that player I’s optimal strategy 
X = (3/19,16/19) can be derived by equalizing the average payoff for 
both of player 2’s strategies. Show that this is true by equating the 
payoff for both SH and DP to the unknown game value v. Solve the re- 
sulting equations for x and v. 

(3) Rock-paper-scissors is a TPZS game with a circular domination rule: 
Rock beats scissors beats paper beats rock. Model it as a 3x3 game 
where the payoff is the net number of times that player 1 wins (1 if his 
choice dominates, -1 if his choice is dominated, otherwise 0). Show 
that the value of this game is 0, and that the optimal mixed strategy for 
both sides is (1/3, 1/3, 1/3). 

(4) Reconsider Example 2. Give Blotto four regiments, instead of three, 
and solve the resulting game. 

Ans. Blotto and his enemy are each equally likely to choose any strat- 
egy, and the game value is 0.8. 

(5) Solve the following 4x5 example of a guessing game. The strategies 
might be cells in which player 2 might hide, or cells in which player 1 
might search 
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Ans. x = (0.17, 0.34, 0.06, 0.42), y = (0.29, 0, 0.29, 0.43, 0), and 
v = 0.857. 



(6) Consider the logistics game where n = 3, and 
/i(>’i) = 3/(l + y,), / 2 (P 2 ) = 2/(1 + >’ 2 )> and/ 3 (>’ 3 ) = l/(l-l-5>’3). Using 
sheet “Logistics” of Chapter6.xls as a model, construct a spreadsheet 
that uses Solver to find the best way for player 2 to partition a budget 
ofy over the three areas, withy being an input. Ify = 2, what is the best 
y, and what is the value of the game? 

Ans. y = (1.4, 0.6, 0), and v = 1.25. 
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(7) Consider a Blotto game where there are six attackers, eight defenders, 
and four targets. All attackers and defenders are perfect, so each target 
will be killed if and only if the attackers outnumber the defenders. The 
payoff is the average number of targets killed. 

(a) What are the max-min and min-max values? 

(b) Consider the following tactic for the defender. Defend the targets 
in pairs, with each pair getting four defenders. Within each pair, assign 
a number of defenders to one target that is equally likely to be any 
number between 0 and 4, and the rest of the defenders to the other tar- 
get. This tactic guarantees that the average number of targets killed 
will not exceed . . . what? 

Ans. The max-min value is 0, and the min-max value is 2 (achieved 
by defending each target with two defenders). In part b, the suggested 
randomized tactic guarantees that not more than 1.2 targets will be 
killed on the average. That strategy turns out to be optimal for the de- 
fender, that is, the value of the game is 1.2. 




Chapter 7 
Search 



If Edison had a needle to find in a 
haystack, he would proceed at once 
with the diligence of the bee to exam- 
ine straw after straw until he found 
the object of his search... I was a 
sorry witness of such doings, know- 
ing that a little theory and calcula- 
tion would have saved him ninety per 
cent of his labor. 



Nikola Tesla (1931 ) 



7.1 Introduction 

First of all, the subject of this chapter is not searching on the internet. Nor is it 
searching for a job or inner peace, and it has nothing to do with data mining. 
These initial caveats are necessary because practically all intellectual activity can 
be thought of as a search for something, so the term “search” by itself implies very 
little. It is an outright mistake to say “search” to a search engine, and even “search 
theory” will expose many entries having nothing to do with what we have in mind. 

This chapter is about searching for physical objects with sensors that act very 
much like our eyes and ears. Eyes sense electromagnetic radiation, as do radars, 
lasers, and cameras. Ears sense sound vibrations, as do microphones and sonars. The 
fundamental phenomena in each case are such that signal strength decreases rapidly 
with distance from the source, so physical proximity is the primary requirement for 
detection. Details about specific sensors can be found in books such as Skolnik 
(2001) for radar or Urick (1996) for sonar, but will not be needed here. The only im- 
portant point for our purposes is that proximity is required for detection. 

It is hue that we can detect stars over astronomical distances, but that is only be- 
cause they are very shong sources exposed against a background with very lihle noise. 
Targets on Earth, especially those of military interest, are usually not so cooperative, 
so searching often involves motion as the searcher tries to achieve the required prox- 
imity. The searcher will frequently move from place to place or hope that his target 
does so. Roughly speaking, search is a sequence of repeated trials where successes 
happen when the distance between searcher and target is short enough. 

Part of the reason for the increasing importance of search is that targets have 
an increasing tendency to be hidden. During the Vietnam War, the US Air Force 
spent 800 aircraft sorties and lost 10 aircraft in futilely trying to destroy the Thanh 
Floa Bridge near Flanoi. After the introduction of laser-guided bombs, the same 
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bridge was destroyed with only four additional sorties (USAF, 2005). The inereas- 
ing lethality of preeision-guided weapons has made it unsafe to be a well-loeated, 
obviously hostile target, so it should not be surprising that targets of military inter- 
est are inereasingly hard to loeate and identify. 

Some forms of warfare have always been nearly pure seareh problems. The 
effeetiveness of a minefield depends very mueh on the loeations of the eonstituent 
mines being unknown, so elearing a minefield is essentially a seareh problem. The 
whole point of being a submarine is to (nearly) prevent the sueeessful operation of 
eleetromagnetie sensors, thus foreing a diffieult seareh problem on antisubmarine 
forees. 

The great questions in seareh all involve time. We ask, “How long will detee- 
tion take?” or “What is the probability of deteetion in a fixed time?” Deteetion is 
inevitable, given suffieient time. The objeet of seareh planning is to speed things 
up. 

Seareh effort may either be applied eontinuously, as when a man walks 
around on the sidewalk looking for his ear keys, or diseretely, as when an aireraft 
drops a sequenee of sonobuoys, eaeh of whieh examines a eertain seetion of 
oeean. In the diserete ease, seareh theory has mueh in eommon with the firing the- 
ory of Chapter 2. In both eases a sequenee of diserete attempts (shots in Chapter 2, 
looks in this ehapter) is made to aehieve a sueeess (kill in Chapter 2, deteetion in 
this ehapter), so the mathematies of Chapter 2 ean sometimes be applied to seareh 
problems. 

Example 1: A submarine has been loeated within a loeation error that has a eireu- 
lar normal distribution with standard deviation 500 m. An aireraft has four reliable 
sonobuoys available, eaeh of whieh has a deteetion radius of 200 m, and eaeh of 
whieh ean be delivered to any aim point, exeept that eaeh sonobuoy delivery has a 
eireular normal delivery error with standard deviation 100 m. In what pattern 
should the four be aimed, and what is the resulting deteetion probability? To ap- 
proximately answer this question, we might employ the diffuse Gaussian analysis 
of Seetion 2.3.3, with its aeeompanying sheet “DGPattn” of Chapter2.xls. The 
area eovered by eaeh sonobuoy is 7 t( 200)^. Sinee the lethal area of a diffuse Gaus- 
sian weapon is Inb^by, we equate the two areas to find that =by =141.4 m 

for 1 = 1,2, 3, 4. The neeessity of equating lethal areas is what makes this analysis 
an approximation. Sinee the delivery errors are eireular, take (7^ =Oy = 100 m for 

i = 1, 2, 3, 4. Sinee the submarine’s loeation error is eireular and eommon to all 
shots, take Gjj =Oy = 500 m . If this data are entered onto sheet “DGPattn”, using 
a reliability of 1 for eaeh shot beeause the sonobuoys are all assumed to funetion 
as designed, and if the aim points are then optimized using Solver, the result is 
that the four sonobuoys should be aimed in a square of side approximately 416 m, 
and the assoeiated deteetion probability is 0.22. This is referred to as “Pa:” in 
Chapter 2, but in the eurrent applieation it is a deteetion probability. 
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When search is performed continuously, rather than discretely, we need a dif- 
ferent notion of equivalency than the lethal area of Chapter 2. This is the notion of 
sweep width, developed in the next section. 



7.2 Sweep Width 



The simplest description of a sensor is to give its detection range R, the distance 
within which detection will surely happen, or else surely not happen if the distance 
is larger than R. We might say of the unaided eye that its detection range for a tank 
in open terrain in daylight is one mile. Unfortunately, while a fixed detection 
range satisfies our desire for a simple, proximity -based measure of a sensor’s per- 
formance, it is frequently too crisp an idea. The eye’s detection range might de- 
pend on the tank’s speed and camouflage, the turret orientation, the platform the 
eye is mounted on, the presence of smoke or haze on the battlefield, and other fac- 
tors. The idea of a fixed or “cookie-cutter” detection range is so appealing that we 
will continue to use it, but attempts to forecast fixed ranges in the real world are 
often disappointing. Forecast detection ranges for sonars are notoriously subject to 
error - it is not uncommon to be off by a factor of two or more. 

In Chapter 2, a similar indeterminacy problem was solved by introducing the 
damage function D{r), whose meaning was the probability of kill if the weapon 
detonates at range r from the target. We then defined a notion of equivalency 
based on lethal area, as employed in Example 1 above. That tactic will not work 
for continuously moving sensors. The problem is that the notion of a “trial” is not 
well defined. If we let D(r) be the detection probability “af ’ range r, we will then 
have to deal with an infinity of closely spaced trials, since the sensor looks con- 
tinuously. Detection will be nearly certain if all those trials are assumed to be in- 
dependent. Rather than try to deal with such difficulties, we instead proceed by 
defining a basic search experiment that may or may not result in detection. This is 
the approach originally used in World War II by the Operations Evaluation Group 
(OEG, 1946). 

The basic experiment is as illustrated in Figure 1. Relative motion between 
sensor and target is assumed to be at constant velocity in two dimensions, charac- 
terized by a lateral range. 

Definition: If relative motion between sensor and target is assumed to be at con- 
stant, nonzero velocity in two dimensions over a complete line, the lateral range 
is the distance between searcher and target at the point where the distance is 
minimal, taken to be positive if the target lies to the searcher’s left at the minimal 
point or otherwise negative. The minimal point is also called the point of closest 
approach. 
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Figure 1: Illustrating a top view of the basic experiment on the left, shown 
with a positive lateral range, and an asymmetric lateral range curve on the 
right. 

If the experiment of moving over a complete line is carried out, it may or may 
not result in a detection. We do not attempt to keep track of the time or “true 
range” where the detection occurs; it may even occur after the point of closest ap- 
proach. This might happen if, for example, the searcher’s eyes no longer have to 
cope with glare from the sun once he is past the target. Any detection at any point 
on the line constitutes a success for the experiment. Let P{r) be the detection 
probability at lateral range r. 

Definition: A graph of detection probability P{r) versus lateral range r is a lateral 
range curve. The area underneath the lateral range curve is called the sweepwidth. 

Note that a sweepwidth has dimensions of length in spite of being an “area,” 
since the vertical dimension of a lateral range curve is dimensionless. 

Lateral range curves are usually symmetric about 0, unlike the curve shown in 
Figure 1, so it is not usually necessary to remember the left/right sign convention 
for lateral ranges. 

Now imagine that search consists of a series of “passes,” each of which 
can be thought of as movement over a whole line with its own lateral range. 
The lateral ranges themselves will be random, since the location of the target 
is unknown. Let f(r) be the density function of the lateral range for some pass. 
By averaging over the lateral range, we obtain the probability of detection Pd 
for that pass: 




(7.1) 



where W is the sweepwidth. The approximation in (7.1) is accurate as long as the 
density f(r) is nearly constant over the interval where P(r) is significantly positive. 
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in which case the density can be factored out of the integral. This will usually be 
the case when lateral ranges tend to be much larger than the sweepwidth; that is, 
when each pass results in a small detection probability. The integral in (7.1) is 
then the area under the lateral range curve or the sweepwidth. In this sense, the 
sweepwidth is all we need to know about the sensor. If we have to state a single 
number that describes the effectiveness of a continuously moving sensor, the 
sweepwidth W is the most natural one to give. 

If we actually had a definite range sensor that would detect targets inside 
range R, but not otherwise, then the lateral range curve would be 1 if for |r| < R, or 
otherwise 0, and the sweep width would be 2R. We thus have a notion of equiva- 
lency, albeit a different one than the notion of Chapter 2. A sensor with sweep- 
width W is equivalent to a sensor with definite detection range W/2, as long as 
search is of the type for which (7.1) correctly represents the detection probability 
per pass. The last caveat makes equivalency a risky assumption in many of the 
situations where we would like to employ it, but sweepwidth is still a valuable no- 
tion. 

Sweepwidths can be measured by repeatedly performing the basic experi- 
ment. The U.S. Coast Guard has great interest in the detection of marine targets 
from both air and sea platforms and has performed numerous experiments whose 
ultimate goal is to estimate sweepwidth (e.g.. Robe et ah, 1985). Tables of visual 
sweepwidths for marine targets are available (e.g., NSARC, 2000). 

Sometimes it is the target that moves, rather than the sensor, but the same 
theory applies. The interaction between stationary naval mines and their moving 
ship targets is usually imagined to be as in the basic experiment, effectively sum- 
marized by recording a lateral range curve and its corresponding sweepwidth. 
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We now consider the search of a fixed region with area A that is known to contain 
a single stationary target, with no prior knowledge of the target’s location. A sin- 
gle searcher looks continuously for the target, moving at speed V and with fixed 
detection radius R. The assumption that no prior knowledge is available means 
that the target’s location density is constant (necessarily 1/A) within the region. 
We might have a ship looking for a lifeboat, a tank looking for its next target, a 
satellite looking for a particular truck, or a man trying to find his car keys. The 
important parameters are A, V, R, and the amount of search time t. Depending on 
how the search starts, one might argue that a circle of radius R is covered initially 
and instantaneously, but we will make no allowance for any resulting initial detec- 
tions in the following. We will refer to the sweepwidth W = 2R, rather than R, 
even though our initial analysis concerns only a fixed detection radius. 

After a time t, the searcher will have covered a region of length Vt and width 
W, the area of which is VWt. The ratio of this area to the area of uncertainty A 
plays a central role. 
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Definition: The coverage ratio z is the ratio VWt/A, the ratio of the area eovered 
to the area of uneertainty. 

We are usually interested in the deteetion probability as a funetion of time, 
whieh we eall Po(i). In problems where seareh eontinues until the target is found, 
let The the random time until the first deteetion. Sinee = P{T < t ) , the mean 
value of T is given by 

E{T) = j P(r > t)dt = j (1 - {t))dt. (7.2) 

0 0 

For nonnegative random variables like T, the first equality of (7.2) ean be shown 
to be true using integration by parts. Sinee the mean time to deteetion is the area 
underneath a graph of the nondeteetion probability, it will be small when the de- 
teetion probability is large. 



7.3.1 Exhaustive Search 

The best that the seareher ean hope for is that the area VWt is all loeated within the 
region to be searehed and does not overlap with itself. In that ease, sinee the target 
is equally likely to be anywhere in a region of area A, the deteetion probability is 
VWt! A, the eoverage ratio. More preeisely, sinee the deteetion probability F’o(0 
eannot exeeed 1, we have 

(0 = min(l, z), where z = VWt /A . (7.3) 

Formula (7.3) is aetually an upper bound, sinee the seareher eannot hope for 
better. If the seareher ean navigate aeeurately and does not need to turn too often 
while staying inside the region of uneertainty, the formula may be aeeurate. A 
seareher attempting to aehieve an exhaustive seareh might follow a path that looks 
something like the path of a lawn mower; indeed, exhaustive seareh is sometimes 
referred to as a “lawn mower seareh” for that reason. The seareher might also em- 
ploy a spiral of some kind in attempting to eover all of a eireular area without 
overlap. 

The maximum time to deteetion is A/(VW) in an exhaustive seareh, and the 
mean time to deteetion is easily shown to be half of that (Exereise 1). 
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7.3.2 Random Search 

A different kind of approximation is to assume that the covered area will overlap 
significantly with itself, with the amount of overlap being whatever happens “at 
random.” More precisely, we assume that the covered area VWt is in effect shred- 
ded into n pieces of confetti before being distributed uniformly over the region of 

VWt 

interest. Each piece of confetti has a chance of of covering the target. Since 

nA 

each piece of confetti is assumed to be an independent trial, we can use the power- 
ing-up idea of Chapter 2 to conclude that the overall detection probability is 

VWt 

Po (0 = 1 - (1 )" s 1 - exp(-z), where z = VWt! A. (7.4) 

nA 

The approximation symbol in (7.4) means that n is irrelevant as long as it is large, 
with 1 - exp(-z) being the limit as n approaches infinity. It does not matter exactly 
how many pieces of confetti there are, as long as there are lots of them. The im- 
portant thing is that the total amount of area is effectively cut up into confetti. 

Formula (7.4), the random search formula, is the most widely applied formula 
in search theory, but it seems to correspond to a rather unlikely idea. Why would 
anyone want to cut his search area up into confetti, even if he could? The justifica- 
tion for the wide use of (7.4) is not that anyone would deliberately search ran- 
domly, and certainly not that there is any physical analog to scattering confetti in a 
real search, but rather that many of the difficulties placed in the way of real 
searches have the same randomizing effect. Real searches will have overlap, either 
because of the necessity to turn or because navigation is imperfect. Targets some- 
times move slightly as search proceeds, even if they do not intend to, thereby 
spoiling the perfection of a lawn mower pattern. An exhaustive search can only be 
performed with a definite range sensor, whereas most sensors have indefinite 
fields of coverage that force a certain amount of overlap. The effect of all of this is 
that the intended exhaustive search frequently gets turned into something that is 
not far in effect from scattering confetti. 

When t is very small, formulas (7.3) and (7.4) agree that the detection prob- 
ability is approximately (VW/A)t. The coefficient (VW/A) can be thought of as a 
constant detection rate ^ in a Poisson process of detections, with (7.4) being the 
probability that there are no detections over an interval of length t. The reader who 
is familiar with the wide applicability of Poisson processes may consider that ob- 
servation to be a better derivation of (7.4) than the confetti analogy. The character- 
izing assumption of such processes is that events in nonoverlapping intervals of 
time are independent. This is the real difference between (7.3) (which assumes 
that past failure ensures future success through a process of elimination) and (7.4) 
(which assumes that the chance of success in the next interval of time is independ- 
ent of what has happened in the past). 
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According to (7.4), the time T to detection in random search is an exponential 
random variable with parameter X, and its mean is therefore A/iVW), the recip- 
rocal of X. Thus E(T) is exactly twice as large as in exhaustive search. The time 
T lacks memory in the sense that P(r >t + x\ T >t) is independent of t; that is, it 

does not matter how long you have aheady been searching unsuccessfully if you are 
trying to predict when detection will finally occur (Exercise 2). In a sense, search- 
ing at random feels very much like spinning a roulette wheel until “00” finally 
shows up. The wheel cannot remember the past, and neither can a random search. 

Figure 2 shows both formulas (7.3) and (7.4), in addition to a third formula 
that will be introduced in the next section. All three curves start out with the same 
slope (namely X), but otherwise differ significantly. They differ the most when the 
coverage ratio is 1, exactly the amount of time that a searcher who believes he is 
making an exhaustive search would commit to the task. 




exhaustive 

random 

inv cube 



Figure 2: Detection probability versus coverage ratio for exhaustive search, 
random search, and the inverse cube law. 
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Even though searching randomly is not usually the goal, but simply a skepti- 
cal analytic concession to reality, there are circumstances where a feasible method 
of actually performing a random search can be useful. These circumstances often 
involve moving targets (Section 7.6) and may involve robots who need to be pro- 
grammed to move in some specific manner. The default way of accomplishing this 
is“diffuse reflection.” In diffuse reflection, the path of the searcher has the same 
statistical properties as the path of a light photon making diffuse reflections from 
rough walls, hence the name. Consider any convex, two-dimensional region such 
as a circle or rectangle. Begin anywhere, choose any direction and move at fixed 
speed in a straight line until the boundary is contacted. After reflecting from the 
boundary, move at fixed speed in a straight line until the boundary is contacted 
again, and so on. Each reflection from the boundary should be at a random angle 9 
measured from the local tangent to the boundary, and the density function of this 
angle should be 0.5sin(^) for 0 < ^ < ;r . Note that this density is not uniform in 
diffuse reflection. Figure 3 shows the kind of searcher track that results. 




Figure 3: Illustrating diffuse reflection. All movement is at fixed speed and 
the random reflection angle has density O.5sin(0), independent of all other re- 
flection angles. 
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Lalley and Robbins (1987) show that diffuse reflection has most of the prop- 
erties that we expect of a uniformly random search. For example, the region is 
covered uniformly in the sense that, regardless of the shape of the boundary, the 
amount of time that the searcher spends in any subregion depends only on the area 
of the subregion, not its shape or location. If for any reason you should want to 
implement random search, consider diffuse reflection. If you are tempted by 
something else, then be aware that many kinds of random motion will not cover 
the region uniformly. 



7.3.3 The Inverse Cube Law 

The inverse cube law of detection was invented in World War II to deal with 
situations where a lookout is trying to detect the wake of a ship (the wake of a fast 
moving ship is much more visible than the ship itself). The solid angle subtended 
at the eye by such a wake decreases inversely with the cube of the range to the 
wake as long as the altitude of the eye above the plane of the wake remains con- 
stant, hence the name of the law. The essential assumption is that target detection 
is a Poisson process where the event rate is proportional to that solid angle, with 
the proportionality constant depending on such things as the target’s size, its con- 
trast with the background, and other parameters. The sensor’s sweepwidth de- 
pends on this proportionality constant. 

Consider a lawn mower search of an area where the searcher’s parallel passes 
are separated by a track spacing S. The resulting coverage ratio is WIS, where W is 
the sweepwidth. The exact formula for the inverse cube law detection probability 
requires several pages to derive (see OEG, 1946 or Washburn, 2002), so we will 
not reproduce it here, but the final formula is 

Pj, (t) = 20(z^) - 1, where z = W/S. (7.5) 

Equation (7.5) is the middle curve in Figure 2. Since the track spacing possible 
when searching a fixed area A at speed V in time tisS = A/(Vt), the definition of z 
in (7.5) is really the same as in (7.4); i.e., the coverage ratio. 

The appearance of the cumulative normal distribution 0() in (7.5) is an ana- 
lytic mystery. No random variable has been assumed to be normal, and yet the 
cumulative normal function appears! The function is widely tabulated, but cannot 
be expressed in terms of more elementary functions. When z= 1, the detection 
probability is 0.79, well below the detection probability for exhaustive search, but 
well above the detection probability (0.63) for random search. 

In addition to its derivation as the natural function to use when detecting 
wakes, the inverse cube law is of interest because it seems to be a good compro- 
mise between the optimism of exhaustive search and the pessimism of random 
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search (see Figure 2). Most current interest derives from the latter fact. The Inter- 
national Aeronautical and Maritime Search and Rescue Manual (ICAO, 2003) 
asks the search planner to decide whether search conditions are “ideal” or “poor” 
for purposes of calculating detection probabilities, interpreting the former as the 
inverse cube law and the latter as random search. The manual does not employ the 
exhaustive formula. Sweep width is obtained from a table, so the planner’s choice 
of the track spacing S determines the coverage ratio and hence the detection prob- 
ability. 



7.4 Barrier Patrol 



In this section we assume that the searcher uses a cookie -cutter sensor with defi- 
nite range R. The definite range assumption is essential, since the searcher will 
usually follow a highly nonlinear track that contrasts sharply with the basic lateral 
range experiment of Section 7.2. 

Searchers sometimes take advantage of environmental features that force 
moving targets through narrow restrictions. These restrictions might be straits or 
harbor mouths at sea or on land they might be mountain passes, watercourses, or 
(if targets are in wheeled vehicles) roads. By remaining in the vicinity of these re- 
strictions, the searcher hopes to construct a one-dimensional barrier to movement, 
rather than deal with a large, two-dimensional expanse. Patrolling near Gibraltar, 
for example, was a tactic used to prevent U-boats from entering the Mediterranean 
in World War II. 

Let L be the width of the barrier to be protected. We assume that target mo- 
tion is perpendicular to the barrier. Consider a target whose speed is U and assume 
that the barrier penetration point is uniformly distributed over L. If 2R is larger 
than L, then a single, stationary sensor in the middle of the barrier is sufficient. 
Otherwise, let the searcher’s speed be V. The searcher must remain in the vicinity 
of the barrier, so his track will consist of repetitive traverses of a closed curve (see 
Figure 3). What kind of closed curve is best, in the sense of producing the highest 
detection probability? 

The answer to the last question is unknown, but there is good reason to sup- 
pose that a particular upper bound is actually a good approximation to the best 
possible detection probability. To derive it, first imagine that there are actually 
many targets uniformly distributed over a tape that moves past the searcher at 
speed U. A point on the tape is “covered” if it ever comes within distance R of the 
searcher, and the searcher’s goal is to cover the largest possible fraction of the 
tape. This is a simple rephrasing of the question, since the fraction of the tape cov- 
ered is the same as the probability of detecting any particular target on it. 

If ^is the angle between the two velocity vectors, then the searcher’s speed 

relative to the tape is 5 = +V^ - 2UV cos(0) . The relative speed S will vary 

with time because 0 varies with time as the searcher moves around the closed 
curve. Flowever, since the average value of cos(^ must be 0 because the curve is 
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closed, the average value of S (symbolically E(S)) cannot exceed \lu^ + . This 

follows from Jensen’s inequality and the fact that the square root is a concave 
function. Now, new tape area shows up at the rate UL, but the searcher can- 
not possibly examine it faster than the rate 2RS at any time or 2RE{S) on the aver- 
age. The ratio of these two rates is therefore an upper bound on the detection 
probability: 






2RE{S) 

UL 




(7.6) 



Of course, Pd must also be smaller than 1 . 




Figure 4: A searcher following a figure-8 type track at speed V hopes to de- 
tect a ship moving at speed U as it tries to penetrate a barrier of width L. 

Referring to Figure 4, if the searcher were to move entirely in the East-West 
direction, his average speed relative to the tape, which is moving West at speed U, 
would be V, the same as if he never moved at all. If the searcher instead moves in 
the North-South direction, he achieves the preferred dynamically enhanced speed 

of , relative to the tape. The trouble is that he must reverse course when 

he nears the North or South border in order to keep his sensor on the tape, and this 
course reversal results in wastefully covering parts of the tape that have already 
been covered. Use of a figure-8 track, instead of a straight back- and- forth track, to 
some extent achieves a course reversal without double coverage, but only par- 
tially. Use of (7.6) amounts to the assumption that the searcher can find some way 
of achieving dynamic enhancement without suffering much from wasteful cover- 
age of areas outside the tape or wasteful double coverage of areas on the tape. To 
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put it another way, use of (7.6) amounts to pasting the two tape edges together, 
which would allow the searcher to instantaneously jump from one edge to the 
other. If the edges were pasted together, the searcher could have dynamic en- 
hancement all the time, without redundant coverage, by continually circling 
around the tube formed by the pasted tape. 

It turns out that the figure-8 class of tracks is large enough that the searcher 
can usually find some way of nearly achieving the detection probability of (7.6). 
Even the class of bow-tie tracks (symmetric tracks with four straight edges, two of 
which are parallel to the channel boundary) is large enough. OEG (1946) consid- 
ers only two bow-tie tracks, one of which is the special case of back-and- forth pa- 
trol. Washburn (1976) generalizes and finds that, in most cases, the bow-tie class 
is sufficiently large to include a member whose detection probability is close to 
(7.6). 

Example 2: Suppose U= V and W=LH. Formula (7.6) gives 0.707 as an upper 
bound on detection probability. Exhaustive examination of all bow tie tracks with 
straight edges where the detection circle just grazes the channel boundary at turns 
gives a detection probability of about 0.69, achieved with a bow tie whose central 
legs make an angle of about 8 1 degrees with the channel boundary. If that angle 
were 90 degrees, we would have a back- and- forth track with a detection probabil- 
ity of 0.68. A wider selection of tracks, including tracks that slightly overlap the 
channel boundary at turns, would produce a slightly greater detection probability 
than 0.69, but surely not greater than 0.707. 



7.5 Optimal Distribution of Effort for Stationary Targets 



The greater part of search theory does not seek optimal tracks, as we did in the 
section above. The main reason for this is that, as analytical objects, optimal tracks 
are hard to deal with. A secondary reason is that most searchers do not want to be 
told exactly what to do, anyway. These problems may disappear in the future, 
given the increasing capabilities of computation, robots, and navigation systems. 
Kierstead and DelBalzo (2003), for example, get encouraging results by applying 
a genetic algorithm to track optimization. Nonetheless, in this section we follow 
historical precedent in seeking to give only rough guidance to the searcher in his 
quest for the target, rather than precise directions about exactly what track to fol- 
low. We assume that there is only one constraint on searching, essentially the 
amount of time available for search. For generality we refer to this constraint as 
“effort.” The problem is then to advise the searcher how he should optimally dis- 
tribute his effort over the various places where the target might be. 

Why not just distribute the effort uniformly? One reason is that the target may 
be more likely to be in some places than others. Another reason is that the searcher 
might be more effective in places like meadows than in other places like swamps. 
Both of these reasons lead to the idea of partitioning the overall search area into 
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“cells” where conditions are uniform. The problem is then one of distributing ef- 
fort among these cells. 



7.5.1 Discrete Effort 

Here we assume that effort is measured in discrete quanta called “looks.” One 
look might correspond to an aircraft sortie, to the amount of searching that a dog 
team can do in 1 day or to an arbitrary time period like 1 hour. The cells are num- 
bered from 1 to n. The geometric arrangement of the cells does not matter because 
the target is assumed not to move, and because we assume that any effort spent by 
the searcher in traveling between cells is negligible. We also assume that the 
searcher has no chance of detection unless he looks in the correct cell; that is, we 
ignore the possibility that the searcher might find the target in one cell while look- 
ing in another. We need two kinds of probabilities: 

Definitions: The occupation probability for a cell is the probability that the target 
occupies it, given the history of the search so far. The overlook probability for a 
cell is the probability that a look in the cell will not find the target, even when the 
target is there. Overlook probabilities are assumed to be given data, not depending 
on the search plan. 

Let Pi be the initial occupation probability for cell i. The corresponding vector 
p is the initial distribution for the location of the target. This probability distribu- 
tion in an input; that is, anyone desiring to use the results of this section will have 
to quantify the initial uncertainty about target location. Richardson and Stone 
(1971) recount the accomplishment of this in the search for the lost nuclear sub- 
marine Scorpion, where cells corresponded to a fixed grid on the ocean. Given the 
communications history, the existence of a seamount in the area, and conflicting 
theories about how the submarine met its demise, each cell in that grid was even- 
tually assigned a probability. 

Let qi be the overlook probability for cell ;, and q the corresponding vector. 
These overlook probabilities might be obtained from one of the formulas of 
Section 7.3, or might be direct estimates from experts. They could in principle 
depend on time, although that is not the case here. We assume that all looks are 
independent, and so, for example, three looks in cell i have an overlook probabil- 
ity qf This assumption should not be accepted casually, since its verity depends 
on the nature of the search in each cell. Suppose that a “look” corresponds to the 
assignment of a human searcher to a particular grid cell for 4 hours, and suppose 
that searchers are methodical people who spend their 4 hours making an exhaus- 
tive search of half of a cell. The overlook probability for one look is therefore 0.5. 
What is the overlook probability for two looks? We would like it to be zero, since 
two looks is enough time for exhaustively searching the whole cell, but the inde- 
pendence assumption would have it be 0.25. The latter might actually be more 
valid, depending on how the search is organized. Perhaps searcher A1 cannot 
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remember what he did the first time when he is once again assigned to look at the 
cell or perhaps the first look is Al’s and the second is Moshe’s, and the two do 
not (or cannot) communicate. This kind of forgetfulness or “friction” might be 
used to justify the independence assumption. If the two looks are each random 
searches of the whole area, rather than partial exhaustive searches, then of 
course the independence assumption is automatically true. In the rest of this sec- 
tion, we assume that the independence assumption holds. 

If the total number of looks in cell i is then it follows from the above as- 
sumptions and the theorem of total probability that the overall miss probability is 



Since x=(xi,...,x„) is under the searcher’s control, except for being nonnegative 
and the requirement that the total number of looks is given, the mathematical 
problem is to minimize (7.7) with x, subject to the constraint that only a certain 
number of looks are available. One solution to this problem is the greedy method 
where each look is assigned to the cell for which the detection probability in- 
creases the most. Let A, be the increase when x is augmented by one additional 
look in cell i. Then, since all terms in (7.7) remain constant except for the one 
where x, is increased. 



Since q^‘ is initially 1 for all ; because x is initially 0, the first look goes to the 

cell where p(J - q) is largest. If the winning cell is cell j, then A is updated by 
multiplying by q , after which the second look is allocated by again searching for a 
maximum, and so on until all looks are distributed. 

Example 3: Suppose p = (0.5, 0.4, 0.1) and q = (0.9, 0.5, 0). Cell 3 might corre- 
spond to the proverbial streetlight under which the drunk is tempted to search for 
his lost keys (q^ = 0, so detection is certain if the keys are there), but the trouble is 
that he probably did not lose them there (p,= 0.1). Table 1 shows how the first five 
looks would be allocated by the greedy algorithm, breaking ties by choosing the 
lower numbered cell. Note that cell 3 is searched, but only on the third look, and 
that most of the looks go to cell 2. The searcher is so efficient in cell 3 that one 
look suffices, and so inefficient in cell 1 that effort is better allocated to cell 2. The 
detection probability for all five looks is 0.2-I-0.1 -i-O.l -H 0.05 H- 0.05, computed by 
adding up the five selected (bold) A’s from Table 1 or alternatively by applying 
(7.7) with x = (l,3,l). Either way, the best detection probability after 5 looks is 0.5. 




(7.7) 



1=1 



A- = A?,"' 






(7.8) 
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look # 


winning cell 


A, 


A, 


A, 


1 


2 


0.05 


0.2 


0.1 


2 


2 


0.05 


0.1 


0.1 


3 


3 


0.05 


0.05 


0.1 


4 


1 


0.05 


0.05 


0 


5 


2 


0.045 


0.05 


0 



Table 1: Showing how A is updated and the winning cell selected for the first 
five looks of Example 3. The winning cell always corresponds to the largest A, 
shown bold. 

The greedy algorithm ean also be interpreted differently. One ean imagine 
that the seareher at eaeh stage applies Bayes theorem (Appendix A) to obtain the 
eurrent distribution of the target’s position, given that no deteetion has oecurred 
yet, and then treats the next look as if it were the first, but with the updated oceu- 
pation probabilities. Consider the situation after three looks in Example 3. The dis- 
tribution of the target’s position, given that the first three looks fail to deteet it, is 
(0.5, 0.1, 0)/0.6 or (5/6, 1/6, 0). Under the same eondition, the deteetion probabili- 
ties for the next look are (1/12, 1/12, 0), so the fourth look ean go in either eell 1 
or eell 2, as in Table 1 . A review of the Bayesian math will reveal that this will 
always be the ease - the looks are alloeated in the same sequenee under either in- 
terpretation. The updated oeeupation probabilities ean be obtained by just dividing 
out the (1 — 9,) faetors from A„ and then normalizing so that the probabilities sum 
to 1. When (1 — g,) is again multiplied baek in, we are making (exeept for the nor- 
malization faetor) the same eomparisons as in Table 1. The updated oeeupation 
probabilities are thus a suffieient summary of all past seareh and might be made 
the basis of a management information system for the protraeted seareh. 

The primary output of most eomputerized deeision aids for seareh is a display 
of the most reeently updated oeeupation probability distribution. A reeommenda- 
tion of what to do next might also be made, but the software is always prepared 
for the user to ignore it. The user makes his deeision, observes the result, the oe- 
eupaney display is updated to aeeount for the new observation, and the eyele re- 
peats. Sheet “Greedy” of Chapter7.xls eontains an implementation. Terms used on 
that sheet are taken from terrestrial seareh and reseue work. The (l-^,j factors are 
called PODs (POD for Probability Of Detection) and the input distribution p is 
called POAs (POA for Probability Of Area). Applying Bayes theorem is called 
“shifting POAs.” 

The Greedy algorithm described above at no point requires the user to know 
how much search effort remains. It would be equally applicable if the searcher in- 
tended to search until the target is found, whatever number of looks are required, 
in which case the algorithm would minimize the average total number of looks 
expended. The Greedy sequence of looks is optimal in practically any sense of the 
word. This kind of uniform optimality depends on the assumptions made earlier in 
this section, particularly the assumption that the target does not move. 
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7.5.2 Continuous Effort 

We might measure effort in “looks” even when fraetional looks are permitted, but 
in this seetion we will simply refer to time, sinee this is the most eommon measure 
of eontinuous effort. It is eonvenient to write the overlook probability as 
exp(-a,), where a, is some nonnegative number (the natural log of l/g„ to be ex- 
aet). In that ease, (7.7) takes the form 



1 - -Pfl = Z A exp(-a.x,. ). (7.9) 

l=I 

The problem of minimizing (7.9) now involves the eontinuous, nonnegative vari- 
ables X, subjeet to the eonstraint Zx, <t , where t is the amount of time available. 

1=1 

We ean no longer employ the greedy algorithm, sinee there is no time quantum, 
but the minimization problem is nonetheless not eomputationally diffieult. There 
are effieient proeedures tailored to the problem (Washburn, 1981), but general 
nonlinear optimizers sueh as Exeel’s Solver will have no diffieulty with it. Sheet 
“Continuous” of Chapter7.xls will do the minimization for four eells (Exereise 5). 

Equation (7.9) retains the independenee assumption of Seetion 7.5.1. That as- 
sumption is eorreet if the seareh of every eell is random; in faet, the parameter a 
for eaeh eell is simply VWIA, as in Seetion 7.3.2. If the independenee assumption 
is not eorreet, (7.9) should be replaeed by 



i-^«=Za/;(^i), (7.10) 

i=l 

where ffc,) represents the overlook probability when x, units of effort are applied 
to eell i. For example, if the seareh of eell i were aeeording to the inverse eube 
law, then/^(x,) would be an expression involving the eumulative normal distribu- 
tion. As long as/^(x,) is a deereasing, eonvex funetion for all eells, effieient teeh- 
niques for the minimization still exist (Stone, 1975). 



7.5.3 Search of a Bivariate Normal Prior 

Uneertainty about the target’s loeation often takes the form of a normal distribu- 
tion in two dimensions. A eommon reason for this is that several sensors measure 
bearings to the target. If the sensors make normally distributed errors in measuring 
bearings at long range, then the resulting position distribution of the target will be 
bivariate normal. A bivariate normal prior distribution ean be illustrated on a map 
by drawing a standard ellipse, typieally the 2-a ellipse that eontains the target with 
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probability l-exp(-2) = 0.865. Other standards are occasionally used, resulting in 
larger or smaller displayed ellipses, but any such ellipse is a surrogate for a com- 
plete stretched bell-shaped distribution that could be partitioned into cells for pur- 
poses of conducting a search. If the search is random, a standard problem results 
that has been the focus of analysis since World War II. This section summarizes 
the results. 

We suppose that the searcher moves at speed V and sweepwidth W for a time 
period t. The only important thing about these three quantities is the product, 
which can be thought of as the amount of area covered by the searcher if overlap 
is ignored. If the two standard deviations characterizing the ellipse are <3u and <3v, 
we can define a dimensionless measure of effort z = VWt l{27ra^ay ) , a kind of 
coverage ratio. This measure is also defined in Chapter 2, except that the numera- 
tor there is na, rather than VWt. The marksman of Chapter 2 is trying to kill the 
target with n shots, each of which has lethal area a, while our searcher covers area 
by moving continuously. The marksman and the searcher each face the problem of 
arranging a given total area to “cover” the target with the largest possible prob- 
ability. 

As in Chapter 2, we distinguish three cases. The exhaustive case corresponds 
to the optimistic assumption that the total area can be cut and shaped as the 
searcher wishes, without overlap. It leads to the upper bound (2.18). The other two 
cases both assume that the search is random, but differ in how the effort is ar- 
ranged. If tactical simplicity demands that the search effort be distributed uni- 
formly over some region, then the best detection probability is given by the SULR 
formula (2.20), with the optimal region being the ellipse recorded in Chapter 2. If 
the random search effort can actually be distributed optimally (this is not a contra- 
diction in terms - think of strewing confetti non-uniformly), then the best detec- 
tion probability is given by the SOLR formula (2.23). A SOLR search might be 
implemented by making repeated passes over progressively larger regions. As 
Figure 4 of Chapter 2 makes clear; however, there is little difference between the 
SULR and SOLR cases. 

There are cases intermediate between exhaustive and random search, as in the 
next example. 

Example 4. Suppose that <5u= 20, Cv= 10, V= 100, 2, and t = 8, so z = 1.27. 

We assume that search must be uniform within some region. Using (2.20), the 
SULR Po is 0.46. Flowever, suppose that random search is felt to be too pessimis- 
tic an assumption, and that the inverse cube law is expected to apply instead. The 
idea is to perform a search with uniform track spacing inside some elliptical re- 
gion, so (7.5) should apply, but what region should be searched? Since no formula 
for the best region is available, we can only try trial and error. Sheet “BVN” of 
Chapter7.xls finds the best x by exhaustion, where x is the area of the elliptical 
search region divided by the normalizing constant InOuOv- The best x turns out to 
be 1.2. From the definition of x, the best ellipse to search should have radii of 
standard deviations in each direction. The two diameters of that ellipse are 
thus 62 and 31. The uniform track spacing within this ellipse should be R7z or 
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1.57. The expected detection probability is 0.57, significantly higher than the 
SULR value. 



7.6 Moving Targets 



If the search process is extended in time, as is often the case, then it may be neces- 
sary to allow for motion of the target as the search proceeds. A lost person, for ex- 
ample, can move a significant distance during the daylight hours when searches 
are usually conducted. If the distance moved during the search time is small com- 
pared to the cell dimension, we might proceed as if the target were stationary. In 
general, however, some method of allowing for target motion is needed. 

The significance of target motion depends strongly on what motivates the mo- 
tion. The target might be cooperative, as when a lost person moves toward a res- 
cuer. In combat situations, however, the target is more likely to move away from 
the searcher, thus frustrating detection. In fact, one reason for not using active sen- 
sors is that the noisy pings will warn the target of the searcher’s position, thus 
enabling avoidance. 

Both cooperative and noncooperative behavior depend on the target’s early 
detection of the searcher. It is also possible that the target does not have such in- 
formation or is moving around with no goal in mind that has anything to do with 
the possibility of being detected, a type of motion we might call “benign.” It is not 
clear whether benign motion is good or bad for the searcher - an argument can be 
constructed either way. The target’s motion can cause it to find the searcher, as 
any spider knows. On the other hand, the randomness of target motion is one of 
the things that can spoil an attempted exhaustive search. Aspects of each argument 
can be seen in the sections that follow. 



7.6.1 Dynamic Enhancement 



If the target moves “at random” within the search area at speed U, then it is rea- 
sonable to suppose that the angle 0 between the target’s velocity and the 
searcher’s velocity is equally likely to be anything between 0 and 2%. If the 
searcher’s speed is V, then the relative speed between the two is 

+ - 2UV cos(^) . The average relative speed is therefore 



iK 

V= j +V^ -2UVcos(0) 

0 



27T 



(7.11) 




152 



7 Search 



The quantity V is sometimes taken to be a “dynamically enhanced” search speed 
in the sense that a searcher with speed V will have the same prospects of finding a 
stationary target as would a searcher with speed V looking for a target moving 
at speed U. Its use in that sense goes back to at least World War II (OEG, 1946, 
Section 1.5). 

Equation (7.1 1) is a symmetric function of U and V; it makes no difference 
whether the searcher moves at 10 knots while the target moves at 2 knots, or vice 
versa. The enhanced searcher speed is the same (10.1 knots, to be precise) in ei- 
ther case. This observation might cause a target who does not wish to be found to 
wonder whether movement is actually a good idea. 

Although there is no closed form for the integral in (7.1 1) (it is a complete el- 
liptic integral of the second kind), we can at least give some upper and lower 
bounds on V : 

max(U,V)<V<^Ju^+V^ . (7.12) 

The bounds are closest to each other when U and V differ significantly. The worst 
case is when U and V are equal (to 1, say), in which case the lower bound is 1, the 
upper bound is 1.41, and V is actually 4/;r= 1.27. Figure 5 shows a graph. See 
sheet “EquivSpd” of Chapter7.xls for similar graphs or to see how to use the 
VBA function EquivSpd(C/,F), which will calculate the dynamically enhanced 
speed for any arguments. 

Example 5. Suppose that a searcher with speed 40 knots is looking for a target 
with speed 30 knots in a region with area 100 square nautical miles. If the sweep- 
width is 0.5 nautical mile, how long will it take for detection? To answer this we 
first find the dynamically enhanced speed, which turns out to be 63.6 knots. Since 
the target is moving around randomly, we assume that only random search is pos- 
sible, and appeal to Section 7.3.2. According to that section, the mean time to de- 
tection should be, on the average, 100/(0. 5x63. 6) = 3.1 hours. 
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V~ 

Lower Bnd 

Upper Bnd 



Figure 5: Showing the dynamically enhanced equivalent search speed (F~) 
when the target speed is 30, along with upper and lower hounds. 



Although the dynamic enhancement formula has a long history, the authors 
are not aware of any serious ahempt to validate it. Any such attempt would have 
to be specific about what “random motion” means for both searcher and target. If 
the target is motivated to avoid detection, then it may very well move slowly even 
though it is capable of going fast, since significant motion is basically a bad idea 
for someone who wants to hide. In search games where the target wants to delay 
detection as long as possible, the target’s best motion is known to be almost null, 
moving just enough to foil any ahempt at exhaustive search (Gal, 1980, Chapter 4). 
If the target of Example 5 moved in that way, the “enhanced” speed would be 40 
knots, not 63.6. 



7.6.2 Markov Motion 

Rather than trying to reduce the situation to an equivalent one involving a sta- 
tionary target, we might try to extend the theory of Section 7.5 to include 
moving targets. The simplest concept is to imagine that searcher and target 
take turns acting. First, the searcher distributes some effort over the cells. If 
search is unsuccessful, the target may move to a different cell, after which the 
searcher tries again, and so on. It is not usually realistic to suppose that the 
target’s next position is independent of its current position, so we suppose in- 
stead that the target’s motion is a Markov chain (Appendix A). This requires 
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that we specify the chain’s transition function, in addition to the probability 
distribution of the target’s initial position. 

Definition: For a Markov chain defined on a set of cells C, the transition function 
T{x,y,i) is the probability that a target in cell x at time t will move to cell y at time 
t+ \. Such a function must be nonnegative, summing on y to 1 for each xg C, and 
also for each time t. 

A commonly used type of Markov chain is a random walk where the target ei- 
ther moves to a neighboring cell with some specified probability or else remains 
where it is. The general effect of this is that the distribution of position spreads out 
over more and more cells as time goes by. Sheet “RandWalk” of Chapter7.xls il- 
lustrates this for C = {1,. . .,17}. That sheet also illustrates the effect of simultaneous 
searching, but the effect of motion alone can be explored by simply setting the 
overlook probability to 1 . 

Example 6: The set of cells is C= {1,2,3}, with the initial distribution of target 
position being p = (0.51, 0.49, 0). After each search, the target increases its cell in- 
dex by 1 or else does not move if it is already in cell three. In terms of the transi- 
tion function, we have T(l,2, t) = T(2,3, t) = T(3,3, t) = \, for all t, with all other 
transition probabilities being 0. The overlook probabilities are assumed to be 
(0,0,1) in the three cells; that is, the target cannot be detected once it gets to cell 3. 
The overlook probabilities are also 1 if the searcher looks in any cell that does not 
contain the target. There are two opportunities to look, and the question is where 
those two looks should be placed. After exhausting all the possibilities, we con- 
clude that both looks should be in cell 2, since this will guarantee detecting the 
target. If the target is initially located in cell 2, it will be detected on the first look, 
or on the second look if it is initially located in cell 1. 

Let P{x,t) be the probability that the target is located in cell x at time t and is 
not detected by any of the searches before time t. If normalized to sum on x to 1, 
this function is the one that a searcher wondering where to look at time t would 
call “the current distribution of the target’s location” and might wish to see dis- 
played to guide his search at time t. Let q{x,t) be the nondetection probability for a 
look at time t, given that the target is in cell x. This fimction is determined by the 
searcher at time t when he decides which cell to look in or possibly how to spread 
his effort over multiple cells. The formula that advances time is then given by the 
theorem of total probability: 

P{y , ^ + 1) = X P(.^, t)q{x, 0 r(x, y, t); ye C. (7.13) 

A-eC 



The only way to get to cell y at time t+ without being detected before t+\, is 
to be in some cell x at time t, without being detected before t, to not be detected at 
time t, and to move from x to y. That statement is formalized by (7.13). Although 
Example 6 is simple enough not to require it, (7.13) could be used to reach the 
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same conclusions about the effect of searching twice in cell 2 (Exercise 7). For- 
mula (7.13) is used on sheet “RandWalk” of Chapter7.xls to update the distribu- 
tion of the target’s position, accounting for motion as well as the effects of search. 

In this paragraph we use the dot (•) notation to emphasize that we are refer- 
ring to all cells at the stated time, rather than any specific cell. Formula (7.13) 
might be used in the following way: Examine the initial distribution P(«, 1) and 
decide on what q{», 1) should be; that is, decide how to search at time 1. Then use 
(7.13) to compute P{», 2) and inspect that function before detemiining how to 
search at time 2, thus determining q(», 2), and so on. In selecting q(», t) at each 
time, we might choose the search plan that minimizes the immediate nondetection 
probability, which is given by (7.13) at time t with the transition function factor 
omitted. Such a search is myopically optimal in the sense that it maxi- 
mizes the immediate detection probability at each time, given the failure of all 
previous searches. Finding a myopically optimal search is computationally easy 
because it amounts to solving a succession of static search problems, one at a time. 

A myopic search will usually come close to maximizing the overall detection 
probability, but can make mistakes. Applied to Example 6, it would first search 
cell 1 and then not know what to do for the second look, since all three alterna- 
tives result in 0 for the immediate detection probability on the second look (see 
Exercise 7). The overall detection probability after two looks would be 0.51, the 
same as the detection probability after one look. There is a better way to utilize 
two looks, as explained in Example 6. In fact, that example has been deliberately 
designed to make myopic search look nearly as bad as possible. Myopia is usually 
much closer to being optimal than in this example, but the fact is that the global 
optimality property that we found in Section 7.5 disappears when the target 
moves. 

There is a technique - the FAB (Forward And Backward) algorithm - designed 
to find the globally optimal distribution of effort when the target moves. The main 
idea is to introduce into (7.13) another factor that represents the probability of 
nondetection in the future, so that the sum is the nondetection probability over all 
time. The FAB algorithm is iterative and may converge to an optimal solution 
only in the limit. Nonetheless, the speed of modem computers makes applications 
feasible. See Brown (1980) or Washburn (2002) for a complete description of the 
algorithm. 

It should be said that the turn-taking models considered in this section ignore 
the possibility that the target might be detected in one of those turns where the tar- 
get’s location changes, and as such are probably unreliable models for targets that 
move a lot. Recall that the dynamically enhanced search speed of Section 7.6.1 is 
minimized when the target is stationary, an indication that targets who do not want 
to be found should not move much. In contrast, if this section’s reluctant target were 
given his choice of Markov transition functions, his favorite function in many cases 
would make his next location equally likely to be in any cell, which implies a lot 
of motion. These are opposite trends, and the contrast ought to be worrisome. In 
practice, the models of this section are probably searcher-pessimistic, especially for 
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targets that move a lot. Unfortunately, there seem to have been no studies of ex- 
actly what “a lot” means. 



7.6.3 Evasive Targets 

As mentioned earlier, the success of a search operation for a moving target de- 
pends on the target’s motivation. Three clear cases can be distinguished: 

• The target may desire detection. There is a growing literature on this 
topic. It is important to the search and rescue community, but we 
will not delve into it here. 

• The target may be indifferent to detection or unaware that it is being 
searched for. This is certainly the case for inanimate objects and is 
sometimes true for fish, humans, and submarines. Sections 7.6.1 and 
7.6.2 apply to the motion of such objects. 

• The target may know that it is being searched for and take measures 
to avoid being found. This is often the appropriate assumption in 
combat models. This kind of evasive target is the current subject. 

Evasive targets that lack information about the searcher’s activities may or 
may not be able to effectively use their capability for movement. Consider the 
“Princess and Monster” game, which starts when two players are randomly lo- 
cated in the unit disk. The Monster moves at speed 1 until it comes within a given 
small capture distance r of the Princess, after which the game ends. The Princess 
can also move, but she does so blindly because she cannot detect the Monster. 
Eventual detection by the Monster is inevitable, but the Princess would still like to 
delay it as much as possible. The solution of this difficult game is known (Lalley 
and Robbins, 1987). An optimal tactic for the Monster is diffuse reflection, as de- 
fined in Section 7.3.2. The Princess’ optimal tactic has her moving, but just 
enough to foil any attempt on the part of the Monster to search exhaustively. The 
net effect of all this is that the Monster in effect performs a random search of the 
disk. The Princess’ capability for motion is of very little use to her - she cannot do 
things like “go the other way” because she does not know the Monster’s location. 
All she can do is change a potentially exhaustive search into a random one. 

One modification of the Princess and Monster game has her knowing the di- 
rection to the Monster. This modification is sometimes militarily realistic, since 
passive interception of active radar or sonar signals reveals precisely that kind of 
information, usually at long ranges compared to the searcher’s detection radius. 
The Princess will still be captured, but it will take a lot longer. In a similar game 
played in a rectangle, Washburn (2002, Chapter 2) finds that giving the Princess a 
speed of only 20% of the searcher’s speed results in increasing the mean time to 
detection by about 40% over what it would be if the Princess had no directional 
information. This kind of result is part of conventional military wisdom - it is 
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widely recognized that the use of active sensors in the pursuit of an evasive target, 
while sometimes necessary, gives away important information. Washburn’s results 
were obtained experimentally by having officer-students at the Naval Postgradu- 
ate School play the game repeatedly. There is no known analytic solution of this 
game. 

Another famous problem with an evasive target is the Flaming Datum prob- 
lem. The name comes from a situation where a submarine has just torpedoed a 
ship, thereby revealing its own position to pursuing ships or aircraft. The burning 
ship marks a place where the submarine once was, hence the name. An abstract 
version with a pursuer and evader has five parameters: 

• U is the evader’s maximum speed (we assume unlimited endurance) 

• Vis the pursuer’s speed 

• Wis the pursuer’s sweepwidth 

• X is the time after the initiating event when the pursuer arrives at the 
datum, the “time late” 

• tis the amount of time that the pursuer spends searching 

The exact solution of this problem as a game is again unknown, but, if we assume 
that the pursuer at all times searches randomly within the gradually expanding cir- 
cle that represents the evader’s farthest distance from the datum, the detection 
probability can be shown to be (Washburn, 2002, Chapter 2) 

VW 1 1 

P„(0 = l-exp(- — ( )) (7.14) 

7TU T T+t 

In this case, results are very sensitive to the evader’s speed, which is squared in 
(7.14). The evader can use his speed effectively to become more and more lost in 
the two-dimensional plane, whereas the Princess, no matter what her speed, is not 
allowed to leave the unit disk. Generalizations of (7.14) are considered by 
Hohzaki and Washburn (2001). 

Example 7: Suppose (U, V, W, x) = (20 kt, 100 kt, 4 nm, 1 h). After searching 
for 4 h, the detection probability is Pb( 4) = 0.225. Even in the limit as t ap- 
proaches infinity, the detection probability is only 0.273. On the other hand, if U is 
15 kt instead of 20 kt, the detection probability after 4 h rises to 0.364. Given the 
choice of infinite endurance or of dealing with a slower evader, the pursuer would 
far prefer the slower evader. See Exercise 10. 

As was mentioned above, neither the modified Princess and Monster game 
nor the Flaming Datum problem have been rigorously solved as two-person zero- 
sum games. This regrettable situation can also be expected in other situations 
where an evasive target can move as the game proceeds, especially if the target 
benefits from information about the searcher. As analytic problems, most of them 
are too complicated to solve exactly. This is unfortunate, since game theory is the 




158 



7 Search 



natural perspective. The analytic problems simplify if the evasive target cannot 
move once it chooses its initial location. Chapter 9 includes an example. 



7.7 Further Reading 



Most detection devices, including human eyes and ears, call “targef’ only if the 
received signal is stronger than some threshold. Since signals can exceed a thresh- 
old even when no target is present, there is a seeming necessity to consider false 
alarms along with detections. The trade-off between the two can be summarized in 
a receiver operating characteristic (ROC) curve that shows the false alarm prob- 
ability that is implied by any given detection probability. The impact of a false 
alarm ranges from the time delay required to determine that the contact is not truly 
a target to the 290 lives lost when the USS Vincennes mistakenly shot down an 
Iranian airliner in 1988, thinking it was an attacking warplane. The omission of all 
discussion of such things in the sections above is typical of the search theory lit- 
erature, but there are a few exceptions. Hohzaki (2007) considers a search game 
where false alarms consume the searcher’s limited time. Also see Stone (1975, 
Chapter 6) and Washburn (2002, Chapter 10). Another important effect of false 
alarms is their consumption of munitions. In this book, the UCAV models of 
Chapter 9 include this effect, which can be important. 

While search games such as the Princess and Monster game are diffrcult 
mathematical objects, there is much to be learned from their study. The excellent 
search survey by Benkoski et al. (1991) includes several references to the subject, 
as well as additional papers involving false alarms. 

In practice, searching for a target should be halted if the target is not found af- 
ter a certain time limit, even if continuation is feasible. In search and rescue opera- 
tions, this issue is sometimes handled by initially including an unsearchable cell 
called ROW (Rest Of World). As other cells are searched unsuccessfully, Bayes 
theorem will gradually move probability into ROW, and search stops when the 
ROW probability becomes so high that future success is unlikely. Of course, 
“high” is a relative term, and there is always a trade-off between the value of find- 
ing the target and the cost of continuing the search. See Stone (1975, Chapter 5) 
for relevant analytic optimization methods. 

The sections above implicitly assume that the object of search is a single tar- 
get. As long as all targets are statistically identical and independent, much remains 
the same if the number of targets is unknown or random. The detection probability 
after a fixed time is the same for all targets, for example. However, one does have 
to distinguish whether sampling is with or without replacement; that is, can the 
searcher remember whether a found target has been found before? If the searcher 
can remember and knows how many targets are present, then one possible goal of 
search is to find all of the targets. These issues arise in the UCAV part of Chapter 9. 
Many other interesting extensions can be found in books devoted entirely to 
search, for example. Stone (1975) or Washburn (2002). 
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Exercises 

(1) Use (7.2) and (7.3) to prove that the mean time to deteetion in an exhaustive 
seareh is A/{2VW). 

(2) In Seetion 7.3.1, a elaim is made that the time to deteetion laeks memory. 
Prove that elaim by evaluating the eonditional probability that it is based on. 
Ans. The eonditional probability should be exp(-Xjr), regardless of t. 

(3) Formula (7.4) involves a limit for large n. Demonstrate that the limit holds 
by making a spreadsheet that has a eolumn for n = 1,2,3... . The deteetion 
probability should be equivalent to (7.3) when n = 1 or to (7.4) when n is 
large. For simplieity, set VW/A = 1 . 

(4) Suppose that five searehers are available to eonstruet a barrier aeross a ehan- 
nel with width 20 miles, and that eaeh seareher has a definite deteetion range 
of 1 mile. Also suppose that V. 

(a) If the five searehers are able to patrol eoherently in sueh a manner that 
they are jointly equivalent to one seareher with deteetion radius 5 miles, 
what will be the deteetion probability? Approximate it using (7.6). 

(b) Same as above exeept that the searehers patrol independently at different 
plaees in the ehannel, so that eaeh has an independent ehanee of deteeting 
any transitors. 

Ans. 0.71 in (a) and (using (7.6) again, but powering up the result for one 
seareher) 0.53 in (b). One big ehanee is better than five little ones. 

(5) Use sheet “Continuous” of Chapter7.xls to solve the following four-eell 
problem, in eaeh eell of whieh a random seareh is eondueted. The table be- 
low gives the data for the four eells, with /?, being the probability that the tar- 
get is loeated in the eell. A total of 8 h are available for searehing. 



Cell number 


Pi 


Vj (speed) 


lF,(sweepwidth) 


^,(eell area) 


1 


0.1 


50 


10 


5000 


2 


0.2 


100 


10 


5000 


3 


0.3 


200 


10 


8000 


4 


0.4 


50 


20 


4000 



Ans: First ealeulate a=(0.1, 0.2, 0.25, 0.25). After minimizing (7.9), the op- 
timal X is (0, 0.7, 3.1, 4.2), with deteetion probability 0.45. Given the limited 
amount of time available, it is best to gamble that the target is not in eell 1, 
spreading the effort unevenly over the other three eells. 

(6) Example 4 utilizes sheet “BVN” of Chapter7.xls to solve a problem involv- 
ing inverse eube law seareh. It is not neeessary to use that brute foree teeh- 
nique to solve the eorresponding random seareh problem, sinee formula 
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(2.20) gives the best detection probability analytically, but do it anyway. 
Modify the sheet so that random search replaces inverse cube law search, 
thus verifying that (2.20) is correct. In Example 4, the best ellipse to search 
had diameters of 62 and 31. What are the best diameters if search is random? 
Ans: The best detection probability with random search should be 0.46. 
Correct to the nearest tenth, the best v is 1.1, which corresponds to an ellipse 
that is 30 by 59. 

(7) Show that the myopic solution to Example 6 is as stated in Section 7.6.2. As- 
sign the first look to cell 1, since P(l,l) = 0.51 is the largest of the three 
probabilities. This determines q{», 1), and therefore P(«, 2). Assign the sec- 
ond look using the same principle, compute P(«, 3), and then sum P(«, 3) to 
obtain the probability that the target survives the first two looks. The transi- 
tion function at time 1 is given in Example 6, and the transition function at 
time 2 is the same. 

Ans. You should find that P(«, 3) = (0, 0, 0.49), which sums to 0.49. 

(8) The same as Exercise 7, but this time search cell 2 first in spite of the fact 
that doing so does not maximize the immediate detection probability. Then 
search myopically after that. You should find that P(«, 3)=(0,0,0). 

(9) Sheet “RandWalk” of Chapter7.xls illustrates the effect of repeatedly search- 
ing cell 6 while a target moves according to a random walk over 17 cells. 

a. Experiment with the sheet. If the overlook probability is 1, the target 
should always survive and the distribution of position at time 30 should 
be nearly a bell-shaped normal distribution (the central limit theorem is 
operating). If the overlook probability is 0, cell 6 should act as a barrier 
that prevents any probability from getting into lower numbered cells 
from higher numbered cells. Predict what would happen if the diffu- 
sion probability were 0 and confirm your prediction using the sheet. 

b. Study the formulas in the columns for cells 5, 6, and 7. They all in- 
volve the overlook probability, whereas cells in other columns do not. 
Once you understand the formulas, modify the columns for cells 11, 
12, and 13 to reflect a search with the input overlook probability in cell 
12 at every time. Have you done this correctly? A verification test 
would be to input a symmetric initial distribution. If the distribution at 
time 30 is not symmetric, there is an error somewhere. 

(10) Sheet “FlamDat” of Chapter7.xls includes a partially implemented graph of 
detection probability versus time according to formula 7.14. Complete the 
sheet by providing the missing formula and verify that the claims made in 
Example 7 are correct. Hint: If you write the formula with correct absolute 
and relative references, you will only need to type it once. In ExceF'^, the 
number ;ris Pi(), a built-in function with no arguments. 




Chapter 8 
Mine Warfare 



The US Navy has lost control of the 
sea to a nation without a Navy, using 
pre-World War I weapons laid by 
vessels that were utilized at the time 
of the birth of Christ. 

RAD M Alan Smith (1950) 



8.1 Introduction 



In 1950 during the Korean War, a minefield in Wonsan harbor inspired the open- 
ing quote by Admiral Smith. That minefield delayed the planned landing by over a 
week, while 250 ships steamed back and forth outside the harbor. The US Navy 
lost four minesweepers in the process of clearing it, and several other ships were 
also sunk or damaged. Naval mines were first used effectively in the Russo- 
Japanese war of 1904, where they were decisive in spite of being primitive contact 
mines. With improved sensors, naval mines have been used effectively in every 
significant naval conflict since then (Hartmann, 1979). Most of the damage done 
to the US Navy since World War II has been due to mines (National Research 
Council, 2001). 

Mines are also becoming increasingly threatening on land. Earth’s mid- 
latitudes are populated by an estimated 100 million antipersonnel land mines (Fig- 
ure 1) left over from various wars. Thousands of innocents have been killed or in- 
jured by these mines, to the point where a group of nations has agreed in the Ot- 
tawa treaty of 1997 to cease stockpiling and using antipersonnel land mines 
(Keeley, 2003). This group of nations does not include the major mine producers, 
however, so continued use of land mines can be expected. The reluctance of many 
nations to abandon use of such mines is directly related to their low cost and high 
effectiveness. Very low cost land mines in the form of improvised explosive de- 
vices (lEDs) have proved to be effective against road traffic. 

What is a mine? The distinguishing features are that a mine is stationary once 
laid, that its location is concealed, and that it destroys itself in the process of at- 
tacking its enemy. A spider fails to be a mine only because it lacks the latter prop- 
erty. Like spiders, mines rely for their effectiveness on the enemy’s need to move. 
Since mines do not themselves move, and since they do not need to reload after 
detonation, mines can be simple and inexpensive. Since they are inexpensive, they 
can achieve effectiveness through replication. We generally analyze minefields, 
rather than individual mines. 



A. Washburn, M. Kress, Combat Modeling, International Series in Operations 161 

Research & Management Science 134, DOI 10.1007/978-l-4419-0790-5_8, 

© Springer Science+Business Media, LLC 2009 




162 



8 Mine Warfare 




Figure 1: A landmine being cleared by band. To prevent clearance, some 
landmines are built to detonate if they are even tilted. 

Computer models have been employed for deeades in analyzing minefields, 
both for purposes of planning minefields and for elearing them. Mine warfare is a 
game of measure and eountermeasure in the faee of uneertainty, so some of 
these models are rather eomplex, but we begin with some relatively simple ones in 
Section 8.2. Sections 8.3 and 8.4 deal with minefield planning and minefield clear- 
ing, respectively. Since planning is often conducted in the expectation of clear- 
ance, models based on game theory are applicable. Section 8.5 considers some of 
the possibilities. 



8.2 Simple Minefield Models 



Large-scale models of warfare are generally not built to study the details of mine- 
field construction and countermeasures, but still need to represent mine warfare in 
some simple manner. The object is to retain the essence of mine warfare without 
including too many details, databases, or megaflops. 

The simplest model would be to simply declare a certain region out of bounds 
“because there are mines in there,” perhaps enforced by the rule that anyone who 
enters the area is automatically killed. While this model gets the idea across, it se- 
riously overstates minefield effectiveness. An individual mine generally controls 
so little area that most real minefields are more likely to let an intruder pass safely 
than to kill him. In addition, the effectiveness of a real minefield will generally 
decrease with time because the intruder is usually able to prevent replenishment. 
Thus, even though the simple “permission” model does have its uses and charms, 
it will not be useful for minefield planning or clearance. 
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A slightly more complex model, but still a simple one, would be to imagine 
that N mines are scattered independently at random within a region of area A, and 
that each of these mines has a fixed lethal radius R. Any intruder who comes 
within 7? of a mine will be killed and will thereby be rendered incapable (we as- 
sume) of actuating any further mines. Suppose an intrader moves a distance D 
within the minefield. This movement defines an area of size 2RD that must not 
contain a mine if the intruder is to survive. The probability that any particular 
mine lies within this area is t = 2RDIA; t is the “threaf’ per mine. If there are N 
mines located independently in the area, the threat from the minefield itself is T = 
1 - (1 - t)^. This is an example of “powering up,” as introduced in Chapter 2. 
With probability T, the intruder is sunk and N decreases by one. Otherwise, the in- 
truder survives and N does not change. The number of mines cannot decrease by 
more than one because a sunk intruder is assumed to actuate no further mines. We 
will call this model the ENWGS model because it is used in the ENhanced Naval 
Wargaming System (Wagner et al., 1999). 

The ENWGS model fixes the defects of the permission model without becom- 
ing overly complicated. If realistic numbers are substituted for t and N, the mine- 
field threat T will be realistically small. Barring replenishment with more mines, 
the threat will also decrease with time as N decreases. 

A slight generalization of the ENWGS formula has been used at times in 
minefield planning, where the crucial question is how many mines are required to 
make the initial threat of the minefield be sufficiently large. The generalization is 
to replace 2R by W, the sweepwidth of the mine for intruders (see Section 7.2). 
Suppose that the minefield is a rectangle with width B, assumed large compared to 
W, and that each intruder transits the length L of the minefield. Then t^WIB, and 
we have the formula for simple initial threat: 

SIT = l-(l-W/Bf =l-(l-WL/ Af. (8.1) 

Here N is the initial number of mines and the minefield area A is just BL. Using 
(8.1) we could, for example, compute the number of mines required to make SIT 
be at least (say) 0.1 (Exercises 1 and 2). 

The ENWGS model might be described as a Markov chain (Appendix A) 
where the state is the number of mines remaining and a transition corresponds to a 
penetration attempt. With each transition, the state either stays the same (success- 
ful penetration) or decreases by one (casualty). The probability of the latter is 
given by (8.1). 



8.3 The Uncountered Minefield Planning Model (UMPM) 



The main questions in minefield planning concern the number and type of mines 
to be employed. There is something to be said for employing a variety of mine 
types, particularly in situations where minesweeping may precede intrusion, but in 
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this section we consider minefields consisting of only one type of mine. For any 
specific type, the main problem is to determine the number that is needed. 

Although SIT is relevant, minefield planners often consider other measures 
that involve intruders after the first. The “threat profile” is simply a graph of the 
threat (probability of not making it through the minefield) to intruders 1, 2, 3,. . . in 
sequence, with 5/T being the first of those numbers. The threat profile is usually a 
decreasing function as long as the minefield cannot be replenished. One can also 
imagine a group of intruders attempting penetration, and consider X, the number 
that is lost in the process. The probability mass function of X can be graphed as 
the “casualty distribution,” and summarized by the expected value E{X). The 
ENWGS model is capable of predicting either the casualty distribution or the 
threat profile (Exercise 3). 

While it is an improvement over a simple permission model, the ENWGS 
model still has some features that make it unattractive as a minefield planning 
tool. The chief of these is that it does not deal realistically with channelization. 
Minefields are usually planned in considerable uncertainty about where intruders 
will travel, so minefields tend to be much wider than the sweepwidth of an indi- 
vidual mine. A reasonable countermeasure on the part of the intruders is to chan- 
nelize, by which we mean that all intruders try to follow each other through the 
minefield. In this way only a comparatively narrow channel through the minefield 
is ever tested, and mines outside this channel are automatically rendered ineffec- 
tive through lack of opportunity. This tactic does not affect the threat to the first 
intruder, but the ENWGS model also predicts the threat to intruders after the first. 

In extreme cases, ENWGS predictions can be seriously wrong. For example, 
suppose that we have cookie-cutter mines and intruders that can perfectly follow 
each other. If the first intruder makes it safely through the minefield, then he 
leaves a clear channel behind him that is guaranteed not to have any mines in it, so 
all intruders after the first will also make it through the minefield safely if they 
follow him. The threat to the second intrader will be zero if the first makes it 
through safely. The ENWGS model, on the other hand, predicts that the threat to 
the second intruder is still SIT if the first makes it through safely, since N is not 
changed by the first intruder. Clearly, if our minefield model is to give realistic es- 
timates of the threat profile in the face of channelization, then ENWGS must be 
modified. 

The problem with the ENWGS model is that it makes an independence as- 
sumption that is falsified by channelization, an example of the sometimes perni- 
cious effects of the Universal Independence modeling habit described in Chapter 1 . 
For mines to act independently, either the intruders must be in independent locations 
relative to the mines or the mines must move around between intruders. The first 
possibility is false because of channelization, and the second is false because re- 
maining stationary is a fundamental mine property. In the US Navy’s Uncountered 
Minefield Planning Model (UMPM), the independence assumption is replaced by 
one of conditional independence. Specifically, in UMPM we assume that the ef- 
fects of a given mine on a succession of intruders are independent given the loca- 
tion of the mine relative to the channel centerline. 
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Let X be the distance between a mine and the channel centerline, and let A(x) 
be the probability that the mine detonates when an intruder follows the centerline 
through the minefield (we are ignoring the possibility of navigation errors). In the 
temiinology of Chapter 7, x is a lateral range and^(x) is a lateral range curve, but 
we will call A(x) an actuation curve in the present application. The area under the 
actuation curve is the sweepwidth W. Figure 2 shows half of a typical actuation 
curve. The important point to note is that, from the viewpoint of a minefield plan- 
ner, actuation curves are regrettably “sloppy” in the sense that results are hard to 
predict compared to a cookie-cutter curve. Imagine that the mine’s lethal radius is 
40. Sometimes mines will detonate at distances larger than 40 (wasted fires), and 
sometimes mines will fail to detonate at distances smaller than 40 (“wasted oppor- 
tunities”). The ENWGS model acknowledges neither kind of waste, but the 
UMPM model deals with both by using the entire actuation curve in making its 
calculations, rather than just the sweepwidth W. 




Figure 2. Actuation probability is plotted versus positive lateral ranges. Tbe 
left-band side of tbe actuation curve is symmetric. 



Given our conditional independence assumption, the probability that one of n 
intruders who follow the same channel centerline will detonate a mine at lateral 
range x can be obtained by powering up the actuation probability: 

R^ix) = l-{l-Aix)y. (8.2) 

A minefield planner will also be uncertain about the effects of a detonation, 
but less so than with actuation. For simplicity, suppose that there is some definite 
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range R such that a mine will be lethal if and only if the lateral range is smaller 
than R when the mine detonates. If the minefield has width B, and if the channel 
is perpendicular to this dimension, and if the mine is located at random in the 
minefield, then the probability that one of n intruders is actually a victim of the 
mine is 



R 

R„* = (l/B)j R„(x)dx;n>0. 

-R 



(8.3) 



In (8.3), the reason for dividing by B is that x is assumed to be uniformly distrib- 
uted over the width of the minefield. The formula is written as if the channel cen- 
terline were the center of the minefield, but is actually valid for any centerline as 
long as either 7? or IT is small compared to B. The first step in UMPM is to com- 
pute the probabilities R„* using (8.3), for n ranging from 1 up to S, the maximum 
number of transitors expected. 

Once the probabilities R„* are known, we can generate threat profiles and 
casualty distributions. Suppose a group of S intruders attempts to penetrate the 
minefield. With no loss of generality, imagine that the group all attempt to pene- 
trate simultaneously, so that the number of intruders remaining in the group can 
serve as the state of a Markov chain. If the group has n intruders remaining when 
it encounters a mine, it loses a member with probability R„*. It is not possible to 
lose more than one member because one mine cannot kill multiple intruders, so 
the only other possibility is that the number of intruders does not change. If there 
are initially S intruders, then the transition matrix is an (5-1-1) x (5-1-1) matrix with 
at most two positive probabilities in each row. For example, if 5 = 3, then 



1 0 0 

R,* 0 

0 l-i?2* 

0 0 



0 

0 

0 

1 - 7 ? 3 * 



(8.4) 



The last row of P”, the m-th power of the matrix P, will show the probabili- 
ties of being in the various states after the group of intruders passes m mines. This 
row is effectively the casualty distribution. The threat profile can also be obtained 
from the same matrix, as will be illustrated by example. 

Example 1: Suppose that R„* = 0.5, 0.6, and 0.8 for n = 1,2,3, and that the mine- 
field initially contains four mines. What is the casualty distribution if three intrud- 
ers attempt to penetrate the minefield? We first raise the transition matrix to the 
fourth power to obtain 
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p4 



1 0 0 
0.9375 0.0625 0 

0.7530 0.2214 0.0256 
0.5040 0.3894 0.0960 



0 

0 

0 

0.0016 



The last row shows the probability that 0, 1, 2, or all 3 intruders remain alive, 
which is the desired casualty distribution in reverse order - the probability of no 
casualties out of three is 0.0016 and the probability that all three intruders are 
killed is 0.5040. Other information is obtainable with a little more effort. The av- 
erage number of casualties out of three intruders (call it ps) can be found by 
weighting the probabilities, summing, and subtracting from 3. In this case we find 
= 2.4048. The second and third rows are essentially the casualty distributions if 
only 1 or 2 intruders enter initially, and can be similarly used to find jJ.\ = 0.9375 
and lli^ 1.7274. The threat to the ith intruder is in general = jJ.^ - for i > 1. 
In this case we find t\ = 0.9375, U = 0.7899, and U = 0.6774. 

The main computational tasks in UMPM are the initial computation of 
using (8.3), and the subsequent raising of P to the appropriate power. Once that is 
done, the casualty distribution, the threat profile, and the mean numbers of casual- 
ties are all available with a little more arithmetic. Sheet “UMPM” of Chapter8.xls 
is an implementation for a particular class of actuation curves of the form 
A{x) = AQxp{-{C ! x)’^) , all of which look more or less like Figure 2. The pa- 
rameter A is the maximum possible actuation probability, C is a scale parameter, 
and Z) is a shape parameter. When D is small, the actuation curve is sloppy. As D 
becomes large, the actuation probability drops suddenly from ^ to 0 as the lateral 
range increases through C, with the cookie-cutter curve being the limiting version 
for large D. The only other inputs on the UMPM sheet are the lethal range R, the 
minefield width B, and the number of mines. The maximum number of intruders is 
always taken to be 5= 10. The probabilities are computed directly on the 
spreadsheet by numerical integration. Pressing the command button initiates the 
Markov calculations that ultimately result in a graphically displayed casualty dis- 
tribution and threat profile. 

The UMPM model actually employed by the US Navy differs from sheet 
“UMPM” in handling actual actuation curves, indefinite damage distances, and in- 
tmder navigation errors, but is otherwise similar. Odle (1977) provides the under- 
lying mathematics. 

The reader may already have observed that the “uncountered” part of 
UMPM’s name is not exactly appropriate, since the model correctly reflects the 
results of the assumed channelization countermeasure. Another countermeasure to 
a minefield is to have the most valuable or vulnerable intruder be the last to enter. 
This possibility is responsible for the minefield planner’s interest in the threat pro- 
file, particularly for intruders late in the sequence. One method for increasing the 
threat to late intruders is the use of probability actuators. When a mine receives 
signals that it interprets as the presence of a target, it detonates with a probability 
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that is built into the actuation mechanism. Quantitatively, the effect of this can be 
seen by simply reducing the parameter A in the spreadsheet. The effect will be to 
decrease the threat to the first intruder, but to increase the threat to late intruders. 

A minefield planner usually has multiple concerns that are not easily put on a 
single scale of utility. UMPM is therefore a descriptive decision aid in the sense 
that it evaluates a given decision through various graphs and statistics, rather than 
producing a decision that is “optimal” according to a single criterion. This is true 
of the US Navy’s version of UMPM, as well as sheet “UMPM” of Chapter8.xls. 

The US Navy’s UMPM is not a Monte Carlo simulation, but it could be. 
Sheet “MonteUMPM” of Chapter8.xls includes a command button that performs a 
Monte Carlo simulation to estimate the casualty distribution. None of the UMPM 
sheet’s numerical integrals are necessary, so the MonteUMPM sheet looks much 
less cluttered than the UMPM sheet. You are probably using a computer where 
10,000 replications can be performed in the blink of an eye, so the computations 
could certainly be made accurately in the time available for planning a minefield. 
The threat profile is not calculated, but easily could be. To see the Visual Basic 
code that performs the simulation, look at subroutine Monte() in the attached 
module. 

The MonteUMPM sheet makes exactly the same assumptions as the UMPM 
sheet in order to facilitate comparisons between the two. However, the real advan- 
tage of Monte Carlo simulation, and our reason for bringing this subject up, is the 
ease with which alternative assumptions could be implemented. In subroutine 
Monte(), intruders could be assigned properties such as a type or navigation error, 
and mines could have properties such as depth, type, sensitivity, or orientation. 
This easy extensibility is not true of analytic methods such as the Markov chain 
calculations that he behind UMPM. As each realistic detail is incorporated, ana- 
lytic models will be able to avoid excessive complication only by employing the 
Universal Independence assumption in circumstances where it is false. An in- 
truder’s type or navigation error, for example, while random, is not independently 
random for every mine in the minefield. This pervasive temptation to falsely em- 
ploy UI does not beleaguer Monte Carlo simulations and is one of the main attrac- 
tions of the technique. 



8.4 Minefield Clearance 



It takes much longer to clear a minefield than it does to make one, especially if the 
minefield is designed in the first place to make clearance difficult. Part of the dif- 
ficulty is that the clearance forces do not know certain minefield characteristics 
that are concealed by the minefield planner: the minefield location and dimen- 
sions, the number and type of each mine employed, the location and sensitivity of 
each mine, and possibly other mine settings. One consequence of this uncertainty 
is that minefield clearance is dangerous. Another is that it is difficult to tell exactly 
when a minefield is “clear” of mines or when to stop clearing. 
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There are basically three methods for clearing a minefield: destruction, hunt- 
ing, and sweeping. Destmction is simply applying sufficient lethal force to destroy 
all the mines, regardless of their type or settings. This is simple in concept, but of- 
ten expensive in terms of monetary cost and environmental degradation because 
the lethal force must be applied to every part of the minefield’s area. We will not 
consider it further. With regard to the other two methods. 

Definition: One sweeps for mines by attempting to cause the mine’s sensors to 
detonate the mine in circumstances where the detonation is harmless. If the mine 
is located by some means not involving its own sensors, and then either destroyed 
or avoided, one is instead hunting. 

Both sweeping and hunting are used in practice. Sweeping is very efficient 
when it works, since it is not subject to false alarms and mines are disposed of 
automatically. However, sweeping is subject to mine counter-countermeasures 
such as sensitivity adjustments, multiple sensors, probability actuators, and in- 
creasingly sophisticated signal processing algorithms that can distinguish sweep- 
ing signals from signals emitted by real targets. Hunting is vulnerable to none of 
these, but is affected by false alarms, decoys, and signature reduction. An example 
of the latter is the occasional burial of mines laid on the bottom of the ocean, 
which leaves them effective while disguising them from sonar. Hunting also bears 
the necessity of destroying or otherwise rendering ineffective any mines that are 
found. 

Sweeping and hunting are both essentially search problems in the sense that 
the object is to “find” all the mines in one maimer or another. Minefield clearance 
might even be treated as an exhaustive search problem - after carefully going over 
the minefield once, one would declare the minefield to be “clear.” However, there 
are many reasons why exhaustive search might not be possible. In addition to the 
reasons recounted in Section 7.3.2, in the present context we can add the possibly 
unknown effects of counter-countermeasures such as probability actuators. Re- 
gardless of the amount of sweeping, it is always possible that a mine with a prob- 
ability actuator is still there and functional. As a result, minefield clearance mod- 
els generally reject the concept of exhaustion, reporting “clearance level” instead. 

Definition: The clearance level of a minefield is the probability that a typical 
mine has been cleared. Equivalently, it is the (average) fraction of the mines that 
have been cleared. If there are multiple mine types, there may be multiple clear- 
ance levels. 

Historically, the United States Navy has relied on a pair of clearance level 
models called NUCEVL (Non Uniform Coverage EVaLuator) and UCPLN (Uni- 
form Coverage PLaNner). The former is descriptive (the sweeping plan is an in- 
put, and the clearance level is output), while the latter is prescriptive (the desired 
clearance level is an input, and the optimal plan among those with constant track 
spacing is an output). Both of these are incorporated in the Mine Warfare and En- 
vironmental Decision Aids Library (MEDAL). They are also utilized in the North 
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Atlantic Treaty Organization (NATO) decision aid MCM EXERT (Redmayne, 
1996), in conjunction with the Decision Aid for Risk Evaluation (DARE, Bryan, 
2006). Many other minefield clearance models have been proposed and at times 
utilized, two of which are COGNIT (McCurdy, 1987) and MIXER (Washburn, 
1996). Pollitt (2006) gives a historical review. Some of these models deal with 
concepts other than clearance level, as should be expected of a subject where there 
is room for disagreement about goals. 

The rest of this section consists mainly of discussing some of the important 
questions that must be answered before any new model of minefield clearance can 
be developed. For brevity we will refer only to sweeping in the rest of this section, 
but we do not mean to exclude hunting. After considering these questions and the 
differing ways in which the models mentioned above have dealt with them, a pro- 
totype minesweeping model will be described in Section 8.4.7. 



8.4.1 Are Clearance Forces Vulnerable? 

The Wonsan minefield mentioned in the introduction was unusual in its lethality 
to minesweepers; it is not unusual for minesweeping forces to take no casualties at 
all in the process of clearing a minefield. This realistic experience is one good rea- 
son for not including the possibility of minesweeper casualties in a minesweeping 
model. Other reasons are that any attempt to do so will force a need for data to 
support minesweeper vulnerability, and that an additional goal (avoiding mine- 
sweeper casualties) will have to be considered. There are thus some good argu- 
ments in favor of models where minesweeper casualties are not represented. 

On the other hand, the typical real world absence of minesweeper casualties is 
no accident, but rather the result of carefully considered procedures for clearing 
minefields safely. If the minesweeping model does not represent even the possibil- 
ity of minesweeper casualties, then any attempt to optimize tactics might produce 
tactics that, while efficient in terms of time and clearance level, are dangerous to 
minesweepers. A modeling dilemma results, the solution to which will depend on 
context and available data. Among the models mentioned above, NUCEVL, 
UCPLN, and MCM EXPERT do not include minesweeper casualties, whereas 
COGNIT and MIXER do. 

There is one additional issue if minesweeper casualties are included, and that 
is the influence of casualties on the ability of the minesweeping force to carry out 
its plan. The simplest assumption is that all minesweeper casualties are instantly 
replaced by a miraculous mechanism. Under this assumption, the number of mine- 
sweeper casualties is computed and included in any optimization objective, but the 
minesweeping plan is carried out regardless of casualties. The advantage of this 
simplification is that analytic models are still possible and are simple enough to 
permit optimization of tactics. COGNIT makes this assumption, and so does the 
optimization part of MIXER. The alternative is to include the effects of mine- 
sweeper casualties on the minesweeping plan, including the possibility that the 
plan might have to be prematurely terminated due to lack of equipment. This addi- 




8.4 Minefield Clearance 



171 



tional realism is likely to force the model to become descriptive, rather than pre- 
scriptive. MIXER includes a Monte Carlo simulation that functions in this mode. 
There are thus three reasonable stances for a model to take with regard to mine- 
sweeper casualties: 

• Assume that there are none, as in NUCEVL, UCPLN, and MCM 
EXPERT. 

• Assume that casualties are instantly replaced, as in COGNIT and the 
optimization part of MIXER. 

• Assume that casualties affect clearance, as in the Monte Carlo part of 
MIXER. 



8.4.2 Is Clearance Level a Sufficient Output? 

After a clearance operation, a potential intruder might not be satisfied to know 
only the clearance level. He might instead like to know the threat, the probability 
that he will become a casualty if he attempts penetration. Any statement about 
threat requires some knowledge of the number of mines remaining, which in turn 
requires some knowledge of the number of mines M present in the first place. The 
trouble is that the clearance forces usually do not know M, even after clearance is 
finished. We therefore have another dilemma for the modeler. The advantage of 
restricting effectiveness measures to clearance level is that no knowledge of M is 
required, and the disadvantage is that potential intraders would rather hear about 
threat. This dilemma is not present in minefield planning, where M is known and a 
threat profile is a customary output measure (Section 8.3). However, it is very 
much present in minefield clearance. 

COGNIT and MIXER each require an estimate of M in the form of a prob- 
ability distribution, prior to computing threat. The need for such a distribution will 
not be welcomed by the clearance planner, who will often have no idea how many 
mines are present. An alternative to requiring a user input is to build in a distribu- 
tion that is not controlled by the user. MEDAL and DARE estimate threat based 
on the assumption that all numbers of mines are equally likely. NUCEVL and 
UCPLN confine themselves to clearance level and therefore do not require any as- 
sumptions about M. 



8.4.3 Is Clearance a Sequential Process? 

Information about the minefield is obtained in the process of clearing it, and the 
decision as to whether to continue clearance or not might depend on results 
achieved to date. After several days of clearance with no mines found, for exam- 
ple, one might reasonably terminate a clearance operation prematurely. There is an 
optimal stopping theory that is applicable to such problems, but, to the authors’ 
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knowledge, this theory has never been applied to minefield clearance. All of the 
tactical decision aids mentioned above are “one off’ in the sense that optimized 
clearance effort has a fixed time length. 

An intermediate position is to plan fixed-length clearance campaigns, but to at 
least provide a summary of minefield status at the “end” of clearance in case an- 
other fixed-length campaign is required. MIXER, MEDAL, and DARE do this, 
providing the distribution of the number of mines remaining at the end of clear- 
ance. 



8.4.4 Are There Multiple Mine Types? 

Some models capable of dealing with one type of mine can be easily modified to 
deal with several. For descriptive, clearance level models such as NUCEVL, it is 
merely a matter of doing separate calculations for each type, reporting the clear- 
ance level for each one. As long as the numbers of mines of each type are inde- 
pendent, the total minefield SIT can also be obtained by simply powering up the 
individual SITs. MIXER considers multiple mine types, including the possibility 
in the Monte Carlo part that mines of one type can affect the clearance of another 
type. DARE deals with the possibility that mines might be cleared without reveal- 
ing their type, but the other models mentioned above do not. Monach and Baker 
(2006) present a Bayesian method that applies. 



8.4.5 Are There Multiple Sweep Types? 

In many cases there are a variety of assets available for clearing a minefield. 
Clearance of maritime minefields has historically been done mainly by manned 
ships, but several countries are currently using remotely controlled vehicles, and 
the US Navy also employs helicopters. These different assets usually do not work 
simultaneously in the same minefield, since there is a danger that one asset will 
actuate a mine that will damage another. Several questions therefore arise. There 
is, of course, the division of effort problem - given a fixed time available for 
clearance, how should it be split between the various assets? If the vehicles are 
themselves vulnerable, there is also an order-of-entry problem. If helicopters are 
available, they will typically enter first because they are least vulnerable, but oth- 
erwise the best order of entry may not be obvious. These additional considerations 
greatly complicate the task of constructing an “optimal” minesweeping plan. 
MIXER alone deals with this issue. 
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8.4.6 Still More Questions 

Is the minefield rectangular? If so, will all of the clearance effort consist of paral- 
lel transits? Is it possible that intruders will abandon penetration attempts if initial 
casualties seem too high? If counter-countermeasures are contemplated, should the 
situation be thought of as a game, as in Section 8.5 below? All of these questions 
have been answered in different ways by different models of minefield clearance. 
Rather than pursue them, we turn instead to the formulation of one possible “new” 
model. 



8.4.7 A Prototype Minesweeping Optimization ModeUOptSweep 

In this subsection, we provide one set of specific answers to the questions posed 
above, and develop the implied model. Specifically, we assume 

• no minesweeper casualties 

• the objective is to minimize SIT, the threat to the first intruder 

• clearance is one off, rather than sequential, and 

• there are multiple mine types and sweep types 

Since the objective is to minimize SIT, some assumption about the number of 
mines of type i needs to be made. We assume that the number of mines present is 
a Poisson random variable with mean or, for each of the m types of mine present. 
We also assume that each remaining mine of type i will provide a known threat of 
t, to the first intruder. Thus, for each type of mine under consideration, the user 
must provide two numbers: a; and t,. 

In clearing the minefield, the user employs Xj sweeps in configuration j, with 
the variables Xj being the decision variables in this problem. A configuration might 
be a helicopter towing a sled that is set to actuate magnetic mines or it might be a 
ship towing an underwater device that cuts the mooring cables of moored mines. It 
could be 4 hours of grazing by a herd of goats, if the minefield were on land. Each 
sweep in configuration j is assumed to have a small probability Wy of independ- 
ently removing each mine of type i, and we assume that there are n sweep configu- 
rations available. The average number of times that a mine of type i is removed is 

then y. = ^ W^Xj . Since there are many attempts at removal, each of which suc- 

7=1 

ceeds with a small probability, we take the probability that the mine survives all 
attempts to remove it to be exp(-y,), the Poisson probability that the number of 
removals is 0. The average number of mines that survive all clearance attempts is 
then «, exp(-y,.) , and the average number of lethal mines seen by the first in- 
tmder is 
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z = a,f, exp(-j|) + . exp(-j^^^). (8.5) 

Since the sum of independent Poisson random variables is itself a Poisson 
random variable, the number of lethal mines seen by the first intruder is itself a 
Poisson random variable, and SIT is 1 — exp(— z). This is the quantity to be mini- 
mized, but z can be minimized directly because 1 — exp(— z) is an increasing func- 
tion ofz. 

SIT can be made as small as desired by making all of the Xj very large, but we 
assume that there are resource constraints that prohibit this. Each helicopter sweep 
consumes helicopter hours, for example, and there might be only so many helicop- 
ter hours available within the time allotted for clearance. There might also be con- 
straints on other vehicle types, or on consumable commodities that are available in 
limited supply, or on the overall time available for clearance. To complete the 
formulation as a minimization problem, we must express these constraints mathe- 
matically. 

Let hjk be the amount of resource k consumed by one sweep of type j, and let 
there be units of resource k available over the minesweeping period. If there are 
K types of resource available, then the problem of optimally selecting (xi,...,x„) re- 
duces to the following optimization problem: 

m 

minimize z ^ ^ a.t. exp(- j. ) 

/=i 

n 

subject to V. = V 

U (8.6) 

'^h^,Xj<H,-,k = \,...,K, 

andx^ > 0; 7 = 

This problem has linear constraints and a nonlinear, but convex, objective func- 
tion. As long as the variables x and y are not restricted to be integers, this is a rela- 
tively simple kind of minimization. Sheet “Optsweep” of workbook Chapter8.xls 
uses Excel’s Solver to solve the problem of allocating 4 resources to 7 sweep 
types in order to clear 5 different types of mine. The reader may wish to experi- 
ment with it (see Exercise 8). 



8.5 Mine Games 



The people who manufacture and employ mines are well aware that attempts will 
be made to clear their minefields. This has lead to a variety of counter- 
countermeasures designed to make clearance difficult. Mine hunters looking for 
metal objects can be foiled by making mines nonmetallic or by introducing cheap 
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metallic decoys. Sensors can be used in groups to complicate simultaneous actua- 
tion. An anti-tank mine poses little threat to an individual human, but a field 
of such mines may have occasional antipersonnel mines interspersed to make the 
anti-tank mines dangerous to locate. Mines can incorporate timers and counters 
that delay the mine’s action, hoping to delay it until right after the expected clear- 
ance campaign is over. All of these actions are taken in anticipation of an action 
by a sentient enemy, so it is natural to apply the theory of TPZS games (Chapter 6). 
This section examines three of the many possibilities. 



8.5.1 The Analytical Countered Minefield Planning Model 
(ACMPM) 



There is a natural tendency for minefields to become less effective with time on 
account of sweeping and transits by intruders. Mine counters are one possible tac- 
tic for reducing this tendency. A mine on count j will detonate when actuated only 
if 7 = 1 ; otherwise, each actuation will decrease j by 1 until the mine is finally 
“ripe” (j = 1). By mixing up the counts of the mines, the minefield planner can 
achieve a minefield that is threatening for late intruders, as well as for early ones. 
The problem of determining the ideal mixture of counts is a good candidate for a 
computerized tactical decision aid. We will describe the planning problem in de- 
tail for a minefield with only one type of mine. 

Suppose that intruders are all alike as far as actuating mines is concerned, but 
that one of N intruders (the “chief’) is more important than the others. It will nor- 
mally be in the interest of the intruders to send the chief in last, but the object of 
the minefield planner is to threaten the chief regardless of where he appears in the 
sequence. If t„ is the threat to the nth intruder, let t be the smallest of all these 
numbers. Since the chief may be anywhere in the sequence, the object is to make t 
as large as possible by cleverly setting the counts of the mines. We are in effect 
considering a game where the intruders move last, since they are assumed to know 
the count distribution. 

Suppose further that every intruder actuates each mine independently with 
probability A, and that the chief will be damaged with probability D, conditional 
on actuating a ripe mine. A mine initially set on count j will be ripe just before the 
n transit if and only if the first n-l intruders actuate it exactly j-\ times, a bino- 
mial probability. If Pj„ is this probability, then 



Note that Pu = \, since the number of combinations of 0 things taken 0 at a time is 
by definition 1. If the chief is intruder n, he will be damaged by this mine if 
and only if it is ripe, and if the chief both actuates it and is damaged by it. The 




(8.7) 
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probability of this is ADPj„. If Xj is the number of mines set on eount j, and if all 
mines act independently, then the chiefs probability of surviving all mines is 



1 - = n (1 - )"' ; 1 ^ ^ (8.8) 

./=i 

The upper limit of the product in (8.8) is n because mines initially on counts ex- 
ceeding n cannot threaten the nth intruder. If there are a total of M mines avail- 
able, then (8.8) is a product of at most M factors, each of which is the probability 
of surviving one mine. 

We can now consider the problem of maximizing t, the minimum of all the 
numbers t„, subject to the constraint that the variables Xj must not sum to more 
than M. See sheet “ACMPM” of Chapter8.xls for an implementation. Solutions 
can be surprising. One might think that there would have to be some mines on 
high initial counts to guard against the possibility that the chief is late in the se- 
quence. This turns out not to be true when A is substantially smaller than 1 . If ^ is 
0.2, for example, it would not be unusual to have a mine initially on count 10 still 
be on count 10 after several transits. If A is small enough, in fact, the best tactic 
will be to put all mines on count 1 . It is only in situations where actuation is likely, 
possibly because most intruders are actually minesweepers, that advanced counts 
become attractive (see Exercise 6). 

The Analytical Countered Minefield Planning Model (ACMPM) is a US 
Navy program used to design countered minefields (Bronowitz and Fennemore, 
1975) that avoids some of the artificial assumptions made above. The UI assump- 
tion made above that “every intruder actuates each mine independently” is a bad 
one in the face of channelization, as we pointed out in Section 8.3. ACMPM in- 
stead makes the UMPM assumption of conditional independence. ACMPM also 
deals with a variety of mine types simultaneously and includes resource con- 
straints other than simple constraints on the number of mines. In other words, the 
real ACMPM improves and generalizes the calculations made on sheet 
“ACMPM” ofChapter8.xls. 



8.5.2 Triangular Sweeping 

In this subsection we exclude all countermeasures other than sweeping. Once the 
minefield is discovered, the minesweeping forces randomly select a navigation 
channel through it. Only the navigation channel will be cleared by sweeping, but 
even within the channel there is a guessing game going on. 

A mine that is swept sufficiently often will detonate uselessly, but a mine on a 
high count may also be useless if it is not swept at all, since it may never ripen un- 
til all intruders have passed. The dangerous mines are those with an initial count 
just slightly greater than the number of sweeps encountered. A kind of guessing 
game ensues, since the miner would like to guess the number of sweeps so that he 
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can set the mine counter to be slightly larger than that. To make guessing difficult, 
the sweeper should be unpredictable in the amount of his sweeping. One way of 
doing this would be to partition the minefield into equally sized subregions, with 
the number of sweeps depending on the subregion. If the subregions are numbered 
0,1,2,..., and if the number of sweeps in each subregion is the same as its index, 
then we have “triangular minesweeping.” Figure 3 shows an example where the 
navigation channel is broken up into four subregions, all of which must be trav- 
ersed by any intruder. Triangular minesweeping is optimal in a sense that we will 
now make precise. 



Channel 




Figure 3: A minefield with a randomly selected navigation channel (speck- 
led) broken up into four equally sized subregions labeled with a random 
permutation of (0,1?2,3). Each subregion is swept a number of times that is 
equal to its index. 



It is convenient to temporarily change the definition of ripeness so that a mine 
is ripe when its count is 0, rather than 1 . Let v, be the fraction of mines that are ini- 
tially set to count and let y, be the fraction of the minefield that is swept i times, 
both defined for / > 0. Since the mine counts and the sweeps are determined inde- 
pendently, and since a mine will be ripe after sweeping if and only if the count 
agrees exactly with the number of sweeps, the probability that a randomly selected 
mine will be ripe after sweeping is 



/=0 



(8.9) 



The sweeper wants to minimize p with y, and the miner wants to maximize p with 
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X. We suppose that the average number of sweeps per subregion is limited toy, the 
coverage ratio (Section 7.3) permitted by the time available for sweeping, and that 
both sides knowy. The sweeper’s strategy y is therefore subject to the constraint 

that ^ (y, < y , in addition to being nonnegative and summing to unity. 

1=0 

For simplicity, suppose thaty is a multiple of 0.5, in which case 2y is an inte- 
ger N. Under that supposition, we can find a saddle point for the game. An optimal 
strategy for the minesweeper is to construct A -f 1 subregions, and to make 

N 

y,. =1 /(A -1-1);; = 0,..., A. Since = A(A -t- 1) / 2 , this makes the average num- 

1=1 

her of sweeps per subregion be exactly A/2, which is y, so the strategy y is feasi- 
ble. Furthermore, regardless of x, as long as x is a probability distribution, 

P = T ^iTi = Z ^1 + 1) = 1 /(A^ + 1) , (8.10) 

/=0 /=0 

so the sweeper can guarantee that p is exactly l/(A-l-l), regardless of how the 
counters are set. 

An optimal strategy for the miner is to set x. = — — ; ; = 0, . . ., A- 1, and 

‘ A(A-i-l) 

Xi = 0 for i > A. The miner needs to know nothing about subregions in order to do 
this - the mines can be scattered randomly over the whole minefield. If the miner 
sets the counts according to x, then, regardless of y, as long as y is a probability 
distribution whose mean does not exceed A/2 



■V "V 2(A-;') 2 

P = / x.y, > > y, = 

U ■^A(A-H)^' A(A + 1) 



(A-Z'Ti)^ 



A + 1 



(8.11) 



We thus see that the miner can guarantee that p is at least l/(A-l-l). Since both 
sides can guarantee the same value, the strategies are optimal, and the value of the 
game is 1/(A+1). 

Example 2: Suppose that the navigation channel is a square with side 10 km, with 
area^ = 100 km^. The sweeper has two minesweeping vehicles available for r= 5 
days, each of which is available 15 h per day. The sweeping speed is F= 10 km/h, 
and the sweepwidth is 1F= 100 m. Since there are two sweepers, the coverage ratio 
2VWT 

is y = = 1.5 and A = 3. The sweeper divides the minefield into four equal 

A 

parts, secretly and randomly numbered 0, 1,2, 3, as shown in Figure 3, sweeping i 
times in part i. The miner sets each mine to count (0, 1, 2) with probability (3/6, 
2/6, 1/6). The value of the game is 0.25, so 25% of the mines will be ripe after 
sweeping. The threat of the resulting minefield depends on the total number of 
mines used, of course, but the sweeping plan and the mine count setting do not. 




8.5 Mine Games 



179 



The sweeper could get rid of all mines by sweeping all four regions three times, 
but that would take 10 days, rather than the five that are available. 

The value of this game decreases sufficiently slowly with N that the sweeper 
may have second thoughts about sweeping as a countermeasure. It is tempting to 
hunt instead, since hunting is not affected by mine counts. Hunting would also 
spare the sweeper the necessity of explaining to his superiors why he plans to di- 
vide an apparently uniform minefield into subregions, all of which are treated dif- 
ferently. There is an explanation (see above), but it is not intuitive. 

When the miner uses x, the probability that a mine is ripe after i actuations 
decreases linearly until it becomes 0 after N actuations. If each mine instead were 
to contain a probability actuator that detonates the mine with probability A after 
each actuation, then x, would be A(1 - A)‘ for ; > 0, a geometric distribution that 
also decreases to 0, but only asymptotically. While there is no known sense in 
which the geometric distribution is optimal, there are still some arguments in its 
favor. One is its ease of implementation, since all that is required is a probability 
actuator. Another is the likely falsity of the assumption made above that the miner 
knows N. If N is actually unknown to the miner, the effect should be to “slop ouf ’ 
the linear decrease into something that looks more geometric. 



8.5.3 lED Warfare 



Mines are often used by a miner who is about to lose control of the battlespace, if 
only temporarily. The miner acts, then loses control of the battlespace, then the 
sweeper acts to clear the mines, and finally intruders pass through the minefield. 
This is the “one-off’ point of view taken in all previous parts of this chapter. In 
this section we consider a different point of view where the creation and destruc- 
tion of a minefield proceed simultaneously over an indefinitely long period of 
time, with neither side controlling the battlespace to the extent that activities by 
the other side cease. The motivation for this is the kind of warfare conducted on 
the road network of Iraq in the first years of the current millennium, so we will 
adopt terminology appropriate to that application. The essence of this problem is 
that there are many possible places (road segments) for mines, and both sides allo- 
cate resources to these segments. Each mine placed on a road segment either deto- 
nates as intended by the miner or is found and cleared before it has a chance to do 
so. In the one-off view there is also the possibility that neither of these events will 
happen, but that the mine will instead be left over after all intruders have passed. 
We omit this third possibility on account of the long time periods involved. “In- 
truders” in this kind of warfare are generally compact groups of vehicles, referred 
to below as “convoys.” 

Let i be an index for road segment, and let b, be the traffic level on segment i, 
the rate per unit time at which convoys travel over the segment. The probability 
that a mine on segment i will be actuated by a specific convoy is a„ which may be 
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less than 1 because of various countermeasures carried by convoys. A mine that 
actuates (detonates) will be successful with probability c„ where “success” means 
significant damage to the convoy. All three of these parameters (a„ and c,) are 
assumed to be known to both sides, for all segments. 

The sweeper has a total number of sweeping teams y that he allocates to the 
road segments, with y, being the number allocated to segment i. Each team allo- 
cated to segment i removes each mine on that segment at rate j3i, independently of 
other mines and teams. The sweep effectiveness parameter P, depends on the 
speed of the sweeping teams, the length of the road segment, and other parameters 
that need not concern us - the main point is that the rate of removal is proportional 
to the number of teams assigned, with pi being the proportionality constant. The 
total rate of removal for each mine on the segment, by sweeping teams, is 
then P^y^ . Mines are also removed by convoy traffic when the mines detonate, 
whether successfully or unsuccessfully, but we have yet to address the rate at 
which that happens. 

We assume that sweeping and convoy traffic are independent Poisson proc- 
esses, so there is no way for the miner to predict when the next convoy or sweeper 
will come by. In particular, the miner cannot wait until he observes a sweeping 
team come by, and then rush out to place a mine on the road before the immedi- 
ately following convoy arrives. 

A small digression is appropriate at this point. One might argue that a better 
way for the sweepers to operate would be to directly precede each convoy by a 
sweeping team; that is, one might argue that the sweeping teams should serve as 
escorts. That way the convoys would always be moving over road segments that 
have just been cleared. In fact, the assumption of independence between convoy 
traffic and clearance may strike the reader as an example of the Universal Inde- 
pendence habit that he was warned about in Chapter I . However, there are some 
realistic arguments in favor of assuming independence in this case. Among these 
are 

• The sweep teams might be slower than the convoys. Of course, it would 
always be possible to have the sweep team leave before the convoy, but 
then there would be an exploitable gap between the two. 

• There might not be enough sweeping teams or they might be stationed at 
different places than where convoys originate. Requiring teams to do es- 
cort duty might waste some of their time. 

• Sweeping teams might not remove all mines as they pass over a road 
segment, and therefore cannot guarantee safety even if they do directly 
precede a traffic unit. 

A similar issue arose concerning the escort of convoys in the Battle of the Atlantic 
in World War II. A given fleet of allied ships could either escort convoys or oper- 
ate independently of them, prowling the ocean looking for submarines to sink. 
Both types of tactic were found to be useful at different times in that battle. Here 
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we are ignoring the possibility of escort, assuming that convoys and sweeping 
teams operate independently. 

We now return to an analysis based on the independence assumption. The to- 
tal rate at which a mine on segment / is removed is the sum of the re- 

moval rates due to traffic and sweeping. The proportion of this due to convoys 

is . This proportion is also the probability that a mine placed on seg- 

Uibi + 

ment i will be detonated by a convoy before it is removed by a sweeping team (see 
“race of exponentials” in Appendix A). When multiplied by c„ this is the probabil- 
ity of success for a mine on segment i. 

Now let Xi be the probability that a given mine will be placed on segment i, so 

Z ahcx- 

— ' ' ' ' — . The 
afi,+j3,y, 

miner would like to choose x to maximize this probability, and the sweeping 
teams would like to choose y to minimize it. This is a logistics game (Section 
6.2.3), so it has a saddle point whose value can be found by solving a minimiza- 
tion problem. See Exercise 7. 

Note that ^(x,y) is a probability per mine. Casualties to convoys will be pro- 
portional to the number of mines that are placed on the roads, no matter what the 
clearance forces do. Given the assumptions made above, the clearance forces can 
only minimize the proportionality constant. 

This analysis can be generalized in several ways. For example, one might take 
the origin-destination traffic levels to be given, rather than the segment traffic 
levels, and let convoy routing be part of the sweeper’s problem. Some of these 
generalizations are explored in Washburn (2006). 
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Exercises 

(1) Suppose J^=20 m and 5= 1000 m. Using (8.1), how many mines are needed 
to make SITbe at least 0.2? 

Ans. 12. 

(2) Make a spreadsheet that will answer questions such as in Exercise 1. The 
user inputs W, B, and the desired value of SIT, and the required number of 
mines is calculated. 

Hint: N can be a ratio of logarithms, rounded up. 

(3) Suppose there are three mines in a minefield of width 1000 m, each with a 
definite lethal radius of 100 m and an actuation probability of 1.0 within that 
radius. If two intruders attempt penetration of the minefield, what is the 
probability that both are killed? Compare the ENWGS and UMPM answers. 
Ans. Since ?? = 1000 m and 200 m, the answer for ENWGS is 
(0.488)(0.36) = 0.176. For UMPM, first note that R„* = 0.2 for all n. After 
raising the transition matrix to the third power, the answer is 0. 104. Alterna- 
tively, the probability that the first mine kills an intruder, the second does 
not, and the third one does is (0.2)(0.8)(0.2). There are two more ways in 
which both intruders might be killed by three mines, and the sum of all three 
probabilities is 0.104. 

(4) Suppose that a certain type of mine has a definite lethal radius of 7? = 100 m. 
The mine will always actuate if an intruder comes within 50 m, but the ac- 
tuation probability is only 0.5 if the closest point of approach is between 50 m 
and 150 m. The minefield width is B ^ 1000 m. Using (8.3), find a fonuula 
for??,,*. 

Ans. R„* = 0.2- 0.1 X (0.5)”; « > 0 . Even if a very large number of intruders 
attempt penetration, the probability that a given mine will kill one of them 
does not exceed 0.2. Mines outside of a 200 m channel cannot kill anything, 
even if actuated. 

(5) Suppose you are designing a minefield using sheet “UMPM” of Chap- 
ter8.xls. The minefield width is 7? = 500 m, the shape factor for the mines is 3, 
the lethal radius is 50 m, and there are 22 mines in total. Those parameters 
are fixed, but you can change the scale factor up or down from its default 
value of 30 m by adjusting the sensitivity of the mine’s sensor. You can also 
adjust the actuation probability, but only downward from its default value of 
1/3. You are concerned about two quantities: the average number of intrud- 
ers killed out of 10, and the threat to the last (tenth) transitor. Can you find 
any way to adjust the two controllable parameters from their default values 
that improves both measures? 



8.5 Mine Games 



183 



Ans. There are many ways to do it. If you change the scale factor to 35 m and 
the actuation probability to 0.3, the average number killed improves from 3.59 
to 3.71, and the threat to the last transitor improves from 0.135 to 0.143. 

(6) Worksheet “ACMPM” of ExceF"^ workbook MineWar.xls implements 

Equations (8.5) and (8.6) and invites you to determine the best count distri- 
bution for a given number of mines. See if you can find the distribution that 
maximizes the minimum threat when 20, 10, A = 0.4, and D = 0.3. 

You may wish to take advantage of Excel’s Solver feature, which is set up to 
find the best distribution if the requirement that the number of mines on each 
count must be an integer is ignored. You may also wish to experiment with 
smaller values of A such as 0.1, in which case the benefits of advanced 
counts should be much smaller. 

(7) Consider the lED problem described in Section 8.5.3, with five road seg- 
ments and the data a = (0.1, 1, 1, 1, 1), b = (2, 4, 3, 2, 4), 
c = (0.1, 0.2, 0.3, 0.4, 0.5), and P = (8, 7, 6, 5, 4). If five teams are available, 
how should they be split among the five segments, and what is the resulting 
fraction of successful mines? Answer the question by completing the spread- 
sheet begun on sheet “lED” of Chapter8.xls, where the above data are re- 
corded, and possibly consult sheet “Logistic” of Chapter6.xls. 

Ans.y = (0, 0.35, 0.71, 0.89, 3.04), and 0.12 of the mines are successful. Most 
of the sweep teams are assigned to segment 5, which has lots of traffic and 
unfavorable values for Cs and Ps. No teams are assigned to segment 1, where 
the opposite is true and there is in addition a low actuation probability. 

(8) Consider the optimization problem formulated on sheet “OptSweep” of 
Chapter8.xls. Study it to make sure that it corresponds to the formulation 
given in Section 8.4.7. When Solver is employed, the best feasible allocation 
of sweeps should produce z = 0.2418, which corresponds to an SIT of 
0.2148. If the number of sweeps of each type is forced to be an integer, as 
Solver permits (you need to add a constraint where the adjustable cells are 
required to be integers), the minimized z should increase. Exactly how much 
does it increase? The problem being solved is not of a type where Solver 
guarantees to find the optimal solution (note Solver’s careful characteriza- 
tion of its solution). Can you find a better solution than Solver’s? 

Ans. In the Solver that comes with ExceF’^ 2003, z increases to 0.2561 when 
X is required to consist of integers, using the optimal non-integer starting 
point. The “optimizing” x is (16, 14, 0, 1, 0, 0, 25). The authors do not know 
whether there is a better solution. 




Chapter 9 

Unmanned Aerial Vehicles 



No operational commander 
should have to assign a soldier a 
task that could be done as well 
by a computer, a remote sensor, 
or an unmanned airplane. 

Richard Perle 



9.1 Introduction 



This chapter is concerned with Unmanned Aerial Vehicles (UAVs). A UAV is a 
remotely piloted or self-piloted aircraft that can carry a payload of cameras, sen- 
sors, communications, and electronic warfare equipment. A UAV may carry also a 
weapon, in which case it is called an Unmanned Combat Aerial Vehicle (UCAV). 
UCAVs are effective attack weapons. Typical missions of UAVs are surveillance, 
reconnaissance, target engagement, and fire control for other long-range weapons. 
UAVs vary in design, size, capabilities, and endurance. Small and lightweight 
UAVs, with limited endurance and flying range are used for close-range surveil- 
lance and reconnaissance by tactical units such as infantry battalions and special 
operations teams. Larger and heavier UAVs, with higher endurance and longer 
range, are used for longer reconnaissance missions such as gathering operational- 
level intelligence. At the far end of the line of UAVs stand very large vehicles that 
weigh several tons, can endure continuous missions of 24 h and more, and fly at a 
very high altitude (up to 20 km). The flying range of such UAVs is thousands of 
km, and their primary use is to provide wide area coverage for strategic informa- 
tion-gathering missions. 

In this chapter we consider two types of UAV problems: routing UAVs in re- 
connaissance or search missions, and evaluating the effectiveness of UCAVs in at- 
tack missions. Section 9.2 presents models for optimizing the routes of UAVs in 
two types of missions, while Section 9.3 describes probability models for evaluat- 
ing the effectiveness of UCAVs. 

UAVs have advantages in not risking a human pilot, and in omitting all the 
life support systems that a human demands. It does not follow that UAVs are ex- 
pendable. While small UCAVs are expendable by design, other larger and expen- 
sive UAVs, such as the Predator or Global Hawk, cost millions of dollars and are 
designed to execute many missions. Losing such a UAV has fiscal, as well as tac- 
tical consequences. The resulting aversion to loss explains the emphasis on sur- 
vivability in the next section. 
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9.2 Routing a UAV 



A mission plan for UAVs concerns three main issues: locating the deployment site 
of the UAVs’ ground control units, routing the vehicles in the area of interest, and 
scheduling their flights. Three factors affect the UAVs’ mission plan: 

• objective 

• measure of effectiveness (MOE) 

• operational, technical, and logistical constraints 

The objective of the UAV mission depends on the operational setting and re- 
quirements. For example, short-range mini-UAVs, which carry a light payload 
such as a low-resolution short-range electro-optical sensor, may be operated by an 
infantry battalion to gather tactical intelligence regarding enemy units at the other 
side of the hill. Another possible scenario is where a team of Special Operations 
Forces (SOF) uses such UAVs to search and detect insurgents or terrorists in 
mountainous areas. Larger, high-endurance UAVs may be used for persistent 
open-ocean and littoral surveillance of small vessels over extended periods. 

The MOE that measures the success of attaining the objective depends on the 
operational setting. In the SOF scenario a reasonable MOE is the probability of 
detecting the target (terrorist). In scenarios of persistent surveillance over hostile 
area the MOE may be the expected duration of the surveillance mission or the 
probability that the UAV completes its mission unharmed. 

Several operational, physical, and logistical constraints must be taken into 
consideration when planning UAV missions. Operational constraints include no- 
fly (or high-risk) zones, limited time windows for flying, deconfliction among 
UAVs that share the same airspace (that is, making sure that multiple UAVs do 
not collide in midair), and limited available deployment sites for ground control 
units (GCU) that control the flight of the UAV. Physical constraints are derived 
from limited UAV-GCU communication range, line of sight requirements, possi- 
ble interference among the various UAV-GCU communication channels, and the 
limited field of view of a UAV’s sensor. Logistical constraints, such as fuel capac- 
ity and maintenance requirements, determine the endurance of the UAV. 

In this section we consider two types of missions: 

• Intelligence'. The UAV is sent out to investigate a certain object in 
a certain location. 

• Reconnaissance'. The UAV is on a patrol mission, searching for 
targets. 
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9.2.1 Intelligence - Investigating an Object 

The mission of the UAV is to fly to a specified destination, investigate a certain 
object such as a weapon system or an installation, and send back video images of 
that object. 

The Problem. The UAV flies over hostile territory and therefore may be subject 
to interception by the enemy’s air defense weapons. The object of interest may 
disappear or hide and therefore the mission is time sensitive. In view of these two 
operational aspects of the mission, we consider two MOEs: 

1 . Time, the time it takes the UAV to reach its destination. 

2. Survivability, the probability that the UAV reaches its destination 
unharmed (and thus it is able to execute its mission). 

The objectives are to minimize time and maximize survivability, and the problem 
is how to route the UAV from its base to its destination such that the two objectives 
are attained. A common and relatively simple way to route UAVs is by specifying a 
set of waypoints that the UAV must follow. These waypoints are entered into the UAV 
command unit, and the UAV, using navigation devices such as satellite-based Global 
Positioning System (GPS), moves from one waypoint to another en route to its desti- 
nation. The first waypoint is the base and the last one is the destination. The possible 
intermediate waypoints are determined by technical (e.g., line of sight) and operational 
(e.g., threat zones) considerations. The base, destination and the set of possible inter- 
mediate waypoints, along with the segments that connect them, form a graph, as 
shown in Figure 1. The graph in Figure 1 has 1 1 nodes - base, destination, and nine in- 
termediate waypoints - and 18 edges, which correspond to operationally feasible fly- 
ing segments. For example, the UAV can fly from node (waypoint) 2 to node 6, but 
not to node 4. The problem is to find the best set of intermediate waypoints for the 
UAV’s route, where best is defined by the two aforementioned objectives. In this and 
subsequent models we assume no restrictions regarding the GCU-UAV connection. 
The GCU of the UAV is located at the base node and it is within range and line of site 
with all the other nodes. Also, the UAV has no logistical constraints, e.g., the choice of 
route is not affected by fuel consumption. 




Figure 1: Graph of waypoints. 
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The Model. Suppose that the set of possible intermediate waypoints com- 
prises W waypoints (W^ 9 in Figure 1), and we label the base node by 0, and the 
destination node hy W + 1 (node 10 in Figure 1). The directed edge from node i to 
node j is denoted {ij). More generally, a route from i\ to h to ... to i„ is denoted 
For example, (2, 6, 9) is a route from node 2 to node 9 in Figure 1. 

Each edge {i,j) in the graph has two associated numbers - the flying time t^j on the 
corresponding segment, and the probability that the UAV is intercepted while 
flying on that segment. The parameter is determined by distance, wind condi- 
tions, and the nominal velocity of the UAV, and the probability p.j is determined 

by the locations and capabilities of hostile interception units. We assume spatial 
independence of the latter; the probability of being intercepted on one edge is in- 
dependent of the probability of being intercepted on another edge. This independ- 
ence assumption is reasonable in particular if an enemy air defense unit has lim- 
ited coverage and is effective with respect to only one edge. Thus, the probability 
that the UAV reaches its destination safely if it takes route is 



n- ~ P‘kh+\ ^ i® survivability MOE. 



The mission time MOE for that 



k=i 



n~\ 

route is > t, , - the total travel time. We assume that all the edges are feasible in 

the sense that for each edge (ij) p^ < 1 . 

Next we develop an integer linear optimization model (see Appendix B) that 
determines the optimal route - optimal set of intermediate waypoints - for the 
UAV. The model addresses the two MOEs, survivability and time, described 
above. 

The optimization problem is a variant of the well-known shortest path problem 
(see e.g., Ahuja et ah, 1993). 



Variables 



The variables in this problem are binary: 
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fl if the UAV flies over edge (;, j) 
[O otherwise 



Objective Function 

There are two possible ways to model the MOEs in the optimization problem: 
time-oriented model and survivability-oriented model. In the time-oriented model 
the objective is to minimize the travel time to the destination, while maintaining a 
minimum threshold for the survivability probability. In the time-oriented model 

w w+\ 

the objective is to minimize the total mission time, while satisfying 

i=0 j=l 

the minimum probability requirement. This double sum has the same value as the 
single sum used in defining the mission time MOE above, since the double sum 
includes the flying time for every edge actually flown, and no others. In the sur- 
vivability-oriented model the mission time is constrained and the objective is to 
maximize survivability, under that constraint. It follows that in the survivability 

w w+\ 

oriented model the objective is to maximize nn Pij) 'U The objective can 

1=0 y=i 

just as well be the logarithm of that quantity, since the logarithm is an increasing 
function. Since sums are analytically more convenient than products, we therefore 

w w+\ 

take the objective to be maximizing ^ ^ [ln(l - PiiWii ■ 

1=0 y=i 



Constraints 

Both time-oriented and survivability-oriented models share the same con- 
straints that determine feasible flying routes. 



W+l 

'^a^jX^j — 1 , 



j=i 



(9.1) 



W W+] 



=0’ ' = !’ 

/t=0 _/•=! 


-,w. 


(9.2) 


V..e{0,l}, 




(9.3) 



where 

fl if the UAV can fly over edge (;, j). 
[O Otherwise 
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The constraint in (9.1) indicates that at least one of the possible flying segments 
leaving the base ( edge (Oj/) such that a^j = 1 ) is actually flown. The constraints in 

(9.2) guarantee a continuous route; if one of the incoming edges into a node i is 
used by the UAV, then one of the outgoing edges from i must be used too. 

The additional constraints depend on the orientation of the model. In the time- 
oriented model the requirement is to maintain a minimum level a of survival 
probability. That is, 

w w+\ 

Y,Y^H\-PijWy>\na. (9.4) 

/=o y-1 

In the survivability-oriented model we set a time limit Tq on the duration of the 
travel: 

w w-^\ 

(9.5) 

/•=0 j =[ 

w w+\ 

Thus, the time-oriented model is Min subject to (9.1), (9.2), (9.3), 

1=0 /=1 

w w+i 

and (9.4), while the survivability-oriented model is Max ^^[ln(l-p,)]X, 

1=0 y=i 

subject to (9.1) - (9.3), (9.5). 

Example 1: A UAV is tasked to fly from its base to a given destination, using any 
of nine possible waypoints, as shown in Figure 1 . The flying time and interception 
probability on each edge are given in Table 1 . 

Suppose that the mission is time critical and the UAV must reach the destina- 
tion within 6 min. Solving the survivability-oriented problem, with To = 6 , we ob- 
tain the optimal route shown in Figure 2, with survivability probability of 0.32. 
However, if the mission is not time critical, but the required minimal mission- 
completion probability is 0.7, then we solve the time-oriented problem. Here the 
survivability probability is a constraint, completion time is the objective, and the 
optimal route is the one shown in Figure 3. 
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Figure 2: Optimal route, survivability-orieuted model, Tq= 6; flyiug time = 6 
miu; Pr[Survival] = 0.32. 



Edge 


Flying Time 


Interception Probability 


0,1 


2 


0.1 


0,2 


1 


0.2 


0,3 


3 


0.1 


1,4 


3 


0 


1,6 


2 


0.3 


2,6 


3 


0.3 


2,8 


4 


0.1 


3,5 


1 


0.2 


3,6 


2 


0.2 


3,9 


5 


0.1 


4,7 


3 


0.1 


5,8 


2 


0.2 


6,7 


2 


0.1 


6,9 


2 


0.1 


6,10 


2 


0.5 


7,10 


2 


0.1 


8,9 


3 


0.1 


9,10 


1 


0.3 



Table 1: Flyiug data. 

The completion time for this route is 10 min, and the survival probability is 
0.73, slightly more than the minimum threshold of 0.7. This survival probability 
(0.73) is also the highest possible over all routes. 
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Figure 3: Optimal route, time-orieuted model, ar = 0.7 ; flyiug time = 10 miu; 
Pr[Survival] = 0.73. 

Figure 4 presents the highest possible survivability probability for a given 
flying time, as obtained from solving a series of survivability-oriented problems. 
Notice the significant drop in this probability when the flying time changes from 
10 min to 9 min 




Flying Time 



Figure 4: Maximum possible survival probability for a giveu flyiug time. 



As was mentioned above, the maximum possible survivability probability, for 
any length of flying time, is 0.73. Flying time shorter than 6 min is infeasible. A 
chart of the type shown in Figure 4 can aid field commanders in deciding their 
best courses of action in routing a UAV for an intelligence mission. It demon- 
strates the trade-off between effectiveness (mission-completion time) and risk. 
Sheets “Intelligence-Time” and “Intelligence-Survivability” in Chapter9.xls solve 
the time-oriented and survivability-oriented problems, respectively. The models 
are solved using Excel’s Solver. 
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9.2.2 Reconnaissance - Detecting a Mobile Target 

Consider a team of Special Operations Forces (SOF) deployed in a remote deso- 
late border area attempting to capture a suspicious person (e.g., a terrorist), hence- 
forth called the target. According to intelligence reports, based on communication 
interception and human intelligence, the target is about to enter the area, attempt- 
ing to cross it and infdtrate into the populated zone, with the objective of carrying 
out hostile action. The team is supported by a small UAV, which is launched im- 
mediately upon receiving the intelligence report. The UAV flies over the area and 
transmits live video pictures of its sensor’s field of view to a ground control unit 
(GCU). The UAV is launched from the GCU. 

The Problem. The mission of the UAV is to detect the target before it leaves the 
border area and enters the populated zone. Because the area is essentially deserted, 
and the UAV has a very small signature, we assume that it is not subject to inter- 
ception. For the same reason we assume that the probability of a false-positive de- 
tection is negligible. In order to be able to transmit the video, the UAV must main- 
tain line of sight with its GCU continuously, and it must fly within a certain range 
from that unit. The endurance of the UAV is limited, but we assume that it is 
longer than the time it would take the target to cross the area and disappear inland. 
The intelligence information regarding the target includes his velocity and a set of 
feasible routes. While the target’s average velocity can be reasonably estimated 
based on his mode of travel (e.g., on foot, riding a horse, driving a vehicle), the 
specific route he may take is unknown. The objective of the SOF team is to plan a 
search pattern for the UAV that maximizes the probability of detecting the moving 
target. The target is assumed to be aware that he may be sought by a UAV and 
will choose his route to maximally frustrate the UAV team. 

Figure 5 presents an example of possible entry points into the border area, and 
routes that the target may take in it. 

There are three possible entry points into the border area - El, E2, and E3. 
Each entry point is associated with one or two possible routes the target may take. 
For example, if the target enters the area through El, then the two possible routes 
are (El,a,Dl) and (El,a,c,D2). Entry point E2 has two associated routes and E3 
has one route. Thus, there are altogether 2 -I- 2 -I- 1 = 5 possible routes in Figure 5. 
In addition to three entry points (El, E2, E3) and three departure points (Dl, D2, 
D3), there are four intermediate points - a,b,c,d. 

The routes form a graph in which nodes are the entry points, departure points 
and intermediate points (so there are 10 nodes in Figure 5), and the edges are route 
segments connecting two adjacent nodes. The small circles labeled 1^ are possi- 
ble locations for the GCU. Let r = 1,...,7? denote a route in the graph (7? = 5 in 
Figure 5) and i, ; = 1,. . denotes an edge (/ = 9 in Figure 5). Each edge belongs 
to one or more routes. Let I denote a feasible deployment location for the GCU, / 
= 1,. . .,T (T = 4 in Figure 5). 
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E2 




Figure 5: Entry points, routes and possible GCU locations in the border area. 

We assume that the velocity of the target on each edge is known. Therefore, 
once the information that the target has entered the border area is received, and if 
his route is known, the SOF team can project exactly his whereabouts at any point 
in time (see Equation 9.6 below). The problem is that the target’s route is un- 
known. We assume that time is discrete and that the SOF operation proceeds in 
time steps t = 0,1,..., Fq, where Fq is the time horizon of the operation (i.e., the 
maximum possible time the target spends in the border area). 



The Model. Let, 

^ [l ifthe target is on edge i at time step i, given route r 

“ jo Otherwise. ^ ^ 

The three-dimensional array {c'j } is input data for the mission plan, which is ob- 
tained from analyzing the terrain and the distances of the various possible routes. 

Consider now the UAV. We assume that in two consecutive time-steps the UAV 
can either stay in its current edge or move to an adjacent one. This assumption is not 
restrictive; the set of possible edges to which the UAV can move in one time step can 
be expanded if needed. For a given edge i, let J. denote the set of edges that contains 
edge i and all edges to which the UAV can fly in the next time period. According to 
our assumption, this is the set of edge i and all of its outgoing edges. For example, 

J,,,,,={{E2,b),{b,c),{b,d)}. 

Let Di denote the set of GCU locations such that a UAV launched from such a 
location can reach edge i at the first time step, and let A- denote the set of GCU lo- 
cations that can maintain continuous communication with a UAV flying over edge i. 

The probability that the UAV detects the target during a single time step, 
given the UAV and the target are present in edge i at the same time step, is . 

The probabilities q^, i = depend on the UAV sensor’s field of view, its reso- 
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lution, background clutter, etc. The detections between time steps are assumed to 
be independent. For example, if the UAV and the target were on edge i during 
7] time steps and on edge j during Tj time steps, then the probability of detection 

would be 1 - (1 - (1 - qjf' . 

Variables 

The variables in this problem are binary: 

if the UAV loiters over edge i during time-step t 
Otherwise 

if the GCU is deployed in location / 

Otherwise 

Objective Function 

A natural MOE for this problem is the probability that the target is detected. 
However, to maximize this MOE we need to know the probability distribution of 
the routes the target may take. These data may not be available, and if it is, the 
problem becomes a nonlinear optimization problem, as shown later on. Instead, 
we consider an MOE that hedges against a worse-case detection scenario. 

Given that the target has selected route r, the probability that he is not de- 

^0 I 

tected before leaving the search area is ]”[]”[ (1 - g, . The goal is to find a 

/=1 /=1 

UAV search, and GCU deployment plan such that this probability, under a worse- 
case scenario, is minimized. Thus, the objective function is 

Max ]”[]”[(! - 9 ,.)^"^". (9-7) 

’’ t=\ i=i 

This objective is quite conservative, since it in effect assumes that the infiltra- 
tor chooses his route to maximize his survival probability and does so in full 
knowledge of the UAV search plan. An alternative would be to treat this problem 
as a TPZS game (see Chapter 7) where the target’s route and the searcher’s search 
plan are each unknown when the other is selected. In terms of that game, we are 
calculating an upper bound on its value. As mentioned above, if there is some 
prior information about the likelihood of route r in the form of a probabil- 

R 

ity = 1 , then another alternative would be to minimize the overall prob- 

r=l 

ability of no detection, that is. 
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r=\ !=\ i=\ 


(9.8) 


While the nonlinear objective function in (9.7) can be transformed into a lin- 
ear one, the objective function in (9.8) cannot. Letting 7 be the logarithm of the 
objective function, (9.7) is equivalent to 


MinY 

^ it 




St 


(9.9) 


XXln(l-9,)<V,-F<0, r = \,...,R. 

t=\ Z=1 




Since ln(l - g, ) < 0 , we can write (9.9) in an equivalent form 


MaxW 

Xi, ,Zi 

St 

-X X ln(l - 9, KXu -W>0, r = l,...,R. 

/=1 Z = 1 


(9.10) 


W>0 




Constraints 




A, -X 2,^0, / = 


(9.11) 


1 

+ 

lA 

0 

II 

1 

II 


(9.12) 


i=\ 


(9.13) 


leA,- f=l 


(9.14) 


±Z,<1 


(9.15) 



/=i 



Constraints (9.11) and (9.12) guarantee a feasible search plan for the UAV. From 
constraints (9.11) it follows that the UAV can search an edge during the first time 
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step only if the edge is reachable on time from the launching location. Similarly to 
constraints (9.2) in Section 9.2.1 constraints (9.12) guarantee a feasible continuous 
search pattern. The UAV can search edge i during time step t, ( V,., = 1 ), only if it 
searches that edge, or one of its adjacent edges, at timet-1. Constraints (9.13) 
limit the presence of the UAV to at most one edge at any given time step. Con- 
straints (9.14) restrict the searched edges only to those which are within range and 
line of sight to the GCU. M is a large constant. Constraint (9.15) assures that the 
GCU is deployed in at most one location. 

Thus the optimization problem is (9.10) - (9.15), withV^,,Z, g {0,1} , which is 
a linear mixed-integer (0-1) programming problem. 

Example 2: Consider the scenario shown in Figure 5. Table 2 presents base case 
data. For each edge i the table presents the target’s travel time, the set of GCU lo- 
cations from which a UAV can reach edge i during the first time step, D,, and the 
GCU locations that can maintain continuous communication with a UAV flying 
over edge i, Aj. 



Edge 


(El, a) 


(a.Dl) 


(a,c) 


(E2,b) 


(b,c) 


(c,D2) 


(b,d) 


(E3,d) 


(d,D3) 


Travel 

time 


3 


4 


3 


2 


4 


4 


4 


5 


3 


A 


1,2,4 


1,2,4 


1, 2,3,4 


2,3,4 


1, 2,3,4 


1, 2,3,4 


2,3,4 


3,4 


1,3,4 


A, 


1, 2,3,4 


1,2.3, 4 


1, 2,3,4 


2,3,4 


1, 2,3,4 


1, 2,3,4 


1,2,3,4 


2,3,4 


1,2,3,4 



Table 2: Travel time and accessibility of edges - base case. 



It is easily verified that the time horizon is Tq= 10. We assume that the (in- 
stantaneous) probability of detection on each edge is the same, q = 0.7. Solving 
the optimization problem (9.10) - (9.15) we obtain the following UAV optimal 
search pattern for the base case (Table 3): 



Edge 


(El, a) 


(a,Dl) 


(a,c) 


(E2,b) 


(b,c) 


(c,D2) 


(b,d) 


(E3,d) 


(d.D3) 


Time 

step 


1,2 


- 


3 


- 


4,5 


- 


6 


- 


7,8,9,10 



Table 3: Optimal search pattern for tbe base case 



The GCU is deployed in location 1 . Once launched from location 2, the UAV 
searches edge (El, a) during the first two time steps. Then the UAV flies over (a,c) 
for one time period, (b,c) for two time periods, and (b,d) for one time period. The 
last four time steps are spent in (d,D3). The probability of detecting the target in a 
worst-case scenario is 0.91, which means that at each one of the five possible 
routes the UAV has at least two detection opportunities if the target selects that 
route. 

Suppose now that the topography at the border area is more rugged and the 
communication possibility between the GCU locations and the various edges is 
more restricted, as shown in Table 4. The new updated location for the GCU is 3. 
The new optimal search pattern is also shown in Table 4. The detection probability 
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in the worst-case scenario is now 0.7, which means that in at least one scenario 
(route) there is only one detection opportunity. 



Edge 


(El, a) 


(a,Dl) 


(a,c) 


(E2,b) 


(b,c) 


(c,D2) 


(b,d) 


(E3,d) 


(d,D3) 


Ai 


2,3 


1,2,3 


1, 2,3,4 


2,3,4 


1,3,4 


1,2,4 


1,2,3,4 


2,3,4 


1,2, 3, 4 


Time 

step 


2,3 


- 


1,4 


- 


5 


- 


6 


- 


7,8,9, 1 
0 



Table 4: Optimal search pattern for restricted flying zones. 



9.3 Unmanned Combat Aerial Vehicles 



An unmanned combat aerial vehicle (UCAV) is a self-propelled aerial vehicle that 
typically loiters over the target area, seeking targets. In addition to the functional- 
ity of a UAV, a UCAV is also a weapon, which can attack targets on the ground or 
at sea. A UCAV may be retrievable or disposable. Retrievable UCAVs are rela- 
tively large UAVs that carry one or more munitions such as bombs or missiles. 
They are launched toward the target area in a controlled trajectory, and upon arri- 
val in the target area, they start their search for targets to attack with the munitions 
they carry. Once their munitions are expended, the UCAVs return to their base for 
refit and reload. Disposable UCAVs are essentially precision-guided munitions, 
where the warhead is an integral part of the platform. Clearly, a disposable UCAV 
can attack at most one target. In this section we describe two UCAV models: (1) 
disposable vehicle and (2) retrievable multiple-weapon vehicle. 

In the following models we assume that there are two types of targets in the 
target area: valuable targets (VT) and non-valuable targets (NVT). The definition 
of a VT is derived from the scenario and the mission objectives. For example, in 
some situations armored vehicles would be considered as VTs, while in other 
situations surface-to-air missile batteries would be VTs. All other objects in the 
target area that may be considered by the UCAV as targets are NVTs. In particu- 
lar, a killed VT becomes NVT. The situation we consider is where a UCAV loiters 
over the target area searching for VTs. Once the UCAV detects an object, it inves- 
tigates the object and determines if it is a VT or an NVT. According to the classi- 
fication of the target, the UCAV engages the object or abandons it and continues 
with the search. If the UCAV engages an object, it attacks it and kills it with a cer- 
tain probability. The sensor of the UCAV is imperfect. It may erroneously identify 
a VT as an NVT (false-negative error) and an NVT as a VT (false-positive error). 



9.3.1 Disposable UCAV 



A single disposable UCAV is launched to attack valuable targets (VTs) in a cer- 
tain area. The area contains also non-valuable targets (NVTs). We initially assume 
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that the UCAV is memoryless; it does not keep track of earlier non-engagement 
decisions and therefore may revisit and reinvestigate previously detected objects. 

The Problem. We wish to compute the probability that the UCAV successfully at- 
tacks a VT within a certain given time window. The inputs of the problem are (1) 
number of VTs and NVTs in the target area, (2) UCAV’s mean inter-detection 
time - the time it takes the UCAV to move from one target to another, (3) 
UCAV’s kill probability, and (4) false-negative and false-positive detection 
probabilities. 

The Model. There are T targets in the target area; out of which K targets are VTs 
and T - K are NVTs. The UCAV loiters over the target area and detects targets at 
a rate A. Following detection, the UCAV instantly identifies the object and de- 
cides whether to engage it or abandon it and move on to the next target. The inter- 
detection time is exponentially distributed with meanl/A , which implies that the 
number of detections in a certain time interval At is a Poisson random variable 
with parameter 2.At . The probability of correct identification by the UCAV is q 
and r for a VT and NVT, respectively. The parameter q represents the sensitivity 
of the UCAV’s sensor (1 - ^ is the false-negative probability), while r represents 
its specificity (1 - r is the false -positive probability). In other words, if the UCAV 
detects a VT, it engages it with probability q, and if it detects an NVT, it engages 
it (erroneously) with probability 1 - r. Recall that the UCAV is memoryless; it 
may revisit previously detected (and identified as NVT) objects, and possibly en- 
gage them. The kill probability of a VT, given the target has been engaged, is p. 
For a mission time window of duration T , we compute the probability that the 
UCAV kills a VT during that time window. 

Three possible outcomes may occur following a detection event: 












Correct engagement of a VT, with probability: = —q; 

T — K 

Erroneous engagement of an NVT, with probability: = (1 ~ ^); 



No engagement, with probability: (p^ 



T 



( 1 - 9 ) + 



T-K 

T 



r = 



l-cp^-cp^. 



Note that these probabilities depend on the ratio between the number of VTs 
and NVTs and not on their absolute values. 

Given n detection opportunities during the time window, the probability that 
the UCAV engages a VT is 



/=i 



i-< 

1-^3 



(Pv 



(9.16) 
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The ith tenii in the sum in (9.16) is the probability that the ith detection will 
result in correctly engaging a VT, which requires that the first i—\ detections result 
in no engagement. The second equality in (9.16) is because the sum is a geometric 
series. When « — ^ c<> (i.e., when there is no time constraint for the mission and the 

UCAV has unlimited endurance) then P IFTl ^ ^ — = — , where 

1-^3 \ + pq-r 

p = is the ratio between the number of VTs and NVTs in the target area. 

T-K 

Recall that the number of detections follows a Poisson distribution, therefore, 
given a time window T, absent engagement, the probability of n detections is 

Pr[// = „] = — , (9.17) 

n\ 



From (9.16) and (9.17) we obtain that the (unconditional) probability that the 
UCAV engages a VT is 



P,(FT) = £ 



\-(pl {XtYe- 



=0 l-t^j 

pq 

\ + pq-r 



■^1 






n ! 

-At{ 1 - 



1-^3 






l-e 



V, 

\+p \+p 



(9.18) 



and the probability a VT is killed is pP^(VT) . From (9.18) we can also observe 
that the time until an object is engaged is exponentially distributed random vari- 
able with mean 1/ (/l(l . 

Example 3: The ratio between the number of VTs and NVTs in the target area 
is /? = 1 . The detection rate is A = 1 per minute, and the probabilities are ^ = 0.7, r 
= 0.8, and p = 0.5. The time window is r = 10 min. The kill probability is 
/jPjd (ft) = 0.5x0.77 = 0.385 . (See sheet “Disposable Memoryless” in Chap- 

ter9.xls). Figure 6 presents the effect of the UCAV’s sensitivity {q) and specificity 
(r) on the probability of engaging a VT. 
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Figure 6: The probability of engaging a VT as a function of r and q. 



Notice that the effect of the two error probabilities is not symmetrical. When r 
= 0.5 and q ^ I, P\q{VT) = 0.67, while in the opposite direction, r = 1 and q = 0.5, 
we obtain Pio(F7) = 0.92. From Figure 6 we can conclude that specificity has a 
larger effect than sensitivity on the performance of the UCAV. This comes with 
no surprise since false-negative error can be rectified later on in the mission, while 
false-positive error cannot; engaging the wrong target (NVT) by the UCAV results 
in a mission failure. 

Figure 7 shows the effect of the target ratio p on the VT engagement prob- 
ability, for three values of detection rate X . 



P(VT) 




P 



---A=.5 
A=1 



Figure 7: The probability of engaging a VT as a Function of p and X . 



We can see from Figure 7 that detection rate higher than 1 does not affect the 
VT engagement probability; this probability converges very rapidly to — . 

1-^3 

Suppose now that the UCAV has perfect memory and it would never revisit a 
target that has been detected before and has been identified as NVT. The UCAV 
selects a target at random from the set of targets not detected yet. The maximum 
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possible number of detections in this case is T, and we assume that the time win- 
dow for the mission is such that this maximum can be attained. Note that the T de- 
tections may result in no engagement, if all K VTs are misclassified by the UCAV 
as NVTs, and all T- NVTs are identified correctly. In that event, which occurs 
with probability (1 - , we assume that the UCAV selects at random a tar- 

get and engages it. Unlike the memoryless case, where the probability of engaging 
a VT only depends on the ratio between the number of VTs and NVTs, here this 
probability depends on the absolute numbers of VTs and NVTs, that is, 
P{VT) = P{K,T) . Let Q{K,T) be the probability of a successful engagement be- 
fore the search is over. This probability is obtained recursively: 

Q(K, T) = j(q + (l- q)Q(K - 1, T - 1)) + rQ(K, T -\). (9.1 9) 

In (9.19), the boundary conditions are that g(0,t) = 0, t = 0,...,Tand Q(l,l) = q. 

The first term on the right-hand side of (9.19) represents the situation where a VT 
is detected but is erroneously identified as an NVT. The second term represents 
the situation where an NVT is detected and is correctly identified as such. In both 
cases the search for a VT continues. The probability P{K,T) of engaging a VT is 
given now by. 



P(K, T) = Q{K, T) + j{\- qf . (9.20) 

Example 4: There are 7= 10 targets in the target area, out of which K = 5 are 
VTs. The sensitivity and specificity probabilities are q = r = 0.75 . The probability 

of engaging a VT is 0.769 (See sheet “Disposable Full Memory” in Chapter9.xls). 
From (9.16) it follows that the maximum possible engagement probability (when 
« — > CO ) for the memoryless UCAV is 0.75. Figure 8 presents the values ofP{VT) 
for the situation where the number of VTs is equal to the number of NVTs, and 
the total number of objects range from 2 to 20. Clearly, the full memory UCAV 
outperforms the memoryless one but asymptotically approaches its value -0.75. 
Notice also that the full memory UCAV performs best in this case when the num- 
ber of objects is 4. 
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Full Memory 

Memoryless 



Figure 8: The probability of engaging a VT for the two types of UCAVs, 
q = r = 0.75. 



Figures 9 and 10 depict these probabilities for the situations q = l, r = 0.5, 
and q = 0.5, r = 1 , respectively. 

We observe that the situation of perfect specificity (r = 1) and poor sensitivity 
{q = 0.5) totally dominates the opposite situation of poor specificity (r = 0.5) and 
perfect sensitivity {q = 1). This observation leads to the conclusion that it is much 
more effective to invest in reducing the false-positive error than investing in re- 
ducing the false-negative error. Moreover, while in the first case (Figure 9), the 
full memory UCAV dominates the memoryless one, as in the base case above, the 
reverse is true in the second case (Figure 10) when the specificity is perfect. This 
is no surprise; if the UCAV can always detect an NVT, eventually it will acquire a 
VT. 




Full Memory 
Memoryless 



Figure 9: The probability of eugagiug a VT for the two types of UCAVs, 
^ = 1, r = 0.5. 
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Figure 10: The probability of engaging a VT for the two types of UCAVs, 
q = 0.5, r = 1. 



9.3.2 Retrievable Multiple-Weapon UCAV 

A retrievable UCAV carrying a number of munitions loiters over the target area in 
search of VTs. Once the UCAV detects a target and identifies it as a VT, the 
UCAV engages it with one of its munitions. The UCAV is memoryless and de- 
tects the targets in a random (uniform) manner. 

The Problem. Given a certain number of VTs and NVTs in the target area, and a 
number of munitions on-board the UCAV, the objective is to compute the prob- 
ability distribution function of the number of killed VTs, and the expected dura- 
tion of the operation. Assuming a long endurance of the UCAV relative to the du- 
ration of a typical mission, the operation ends when the last munition is delivered. 

The Model. Let T denote the total number of objects (VTs and NVTs) in the tar- 
get area, and let K denote the initial number of VTs. Thus, T-K is the initial 
number of NVTs. The UCAV carries N munitions at the beginning of the opera- 
tion, and the probability of correct identification by the UCAV is q and r for a VT 
and NVT, respectively. The mean inter-detection time - the time between two 
consecutive detections - is Ed. The mean attack time - the time from detection un- 
til the munition is delivered - is E^. The kill probability is p. 

We develop now a model for obtaining the probability distribution of the 
number of killed VTs (in particular, its expected value) and the expected duration 
of the operation. Suppose that at a certain stage in the engagement there 
arek,k ^ K , VTs, and T-k NVTs. The probability that the UCAV engages a VT 

k T-k 

given detection is—q, and the probability of engaging an NVT is — — — (1 - r) . 

Next define a Markov chain (see Appendix A) that describes the UCAV’s op- 
eration. A step in this chain is a detection event, and a state is a pair (n,k), where n 
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is the number of munitions left on-board the UCAV and k is the number of VTs 
left in the target area (recall that a killed VT is NVT). There are three possible 
transitions following a detection event. 



(n,k) ^ (n - 1, k - 1) with probability — qp, 



(9.21) 



The UCAV killed a VT 



k T-k 

(/7, k)^ {n- 1, k) with probability — q{\ - p) + (1 - r), 

The UCAV acquired 
a VT but missed it, 
or acquired an NVT 

k T-k 

{n, k) (/?, k) with probability — (1 - ^) H r . 

The UCAV did not ^ ^ 

acquire the target 



(9.22) 



(9.23) 



The initial state is {N, K) and the absorbing states (see Appendix A) are (0, k), 
k = Max{Q,K-N},...,K . Note that if the UCAV has perfect sensitivity and speci- 
ficity, that is, q = r = \, N>K, and the number K of VTs is known to the UCAV, 
then the states [n,Q),n = \,...,N - K are absorbing too since the UCAV will 
leave the target area once all the VTs are killed. The possible states are 

{N,K) 

{N-\,KUN-\,K-\) 

(N -2,K),{N -2,K -2,K -1). 



{Q,K),{0,K-\), ... ,(Q,Max{Q,K-N]), 



and the number of states is 
'K + 2 



S=< 



(K + l) 

"N + 2 
. 2 



+ N-K , ifN>K 



ifN<K 



(9.25) 



From now on we assume that the number of munitions loaded on the UCAV is 
not larger than the initial number of VTs in the target area, that is, N < K .In that 
case, the number of absorbing states is V+ 1 (see the last row of (9.24)), and the 

rv-rl) 



number of transient states (see Appendix A) is 



V 



2 



y 
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The Markov transition matrix M can be written as 



M = 




0 

Q\ 



(9.26) 



where is a (A'^ + 1) x (A'^ + 1) unit matrix that corresponds to the absorbing 
states, 2 is a iri^trix corresponding to the transient states, and is a 

|^^^']x(Af+l) matrix that represent transitions from a transient state to an absorb- 



ing one. The probability distribution of the number of VTs killed is given in the 



first row of (I -Q) ’ , where / is a 



(A + l) 


'N + l'' 


X 

1 2 J 


, 2 , 



unit matrix, see Appendix A. 



Note that the first row of (I — Q) ' R corresponds to the initial state of the mission 
before the first munition is delivered. The expected number of visits to a transient 
state (n,k), is the (n,A:)-th entry in the first row of (I-Q) ' (see Appendix 

A), therefore, the expected duration of the mission is 



E[Time] = E^ ^ -I- NE^ . 

{n,k)transient 



(9.27) 



Example 5: A UCAV carrying N=3 munitions is sent out to attack a target area, 
which comprises T = 12 targets, out of which K = 6 are VTs. The probability of 
recognizing a VT is ^ = 0.7, and the probability of recognizing an NVT is r = 0.8. 
The kill probability is = 0.5. The mean inter-detection time is = 3 min, and 
the mean attack time is Ea = 0.5 min. According to (9.25) there are 10 states, out 
of which 4 states are absorbing. The transition matrix (see sheet “Retrievable” in 
Chapter9.xls) is 



States 


(0,6) 


(0,5) 


(0,4) 


(0,3) 


(3,6) 


(2,6) 


(2,5) 


(1,6) 


(1,5) 


(1,4) 


(0,6) 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


(0,5) 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


(0,4) 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


(0,3) 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


(3,6) 


0 


0 


0 


0 


0.55 


0.28 


0.18 


0 


0 


0 


(2,6) 


0 


0 


0 


0 


0 


0.55 


0 


0.28 


0.18 


0 


(2,5) 


0 


0 


0 


0 


0 


0 


0.59 


0 


0.26 


0.15 


(1,6) 


0.28 


0.18 


0 


0 


0 


0 


0 


0.55 


0 


0 


(1,5) 


0 


0.26 


0.15 


0 


0 


0 


0 


0 


0.59 


0 


(1,4) 


0 


0 


0.25 


0.12 


0 


0 


0 


0 


0 


0.63 
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The highlighted area in M is 


the sub matrix Q. 


The matrix / - 


0.45 


-0.3 


-0.2 


0 


0 


0 


0 


0.45 


0 


-0.3 


-0.18 


0 


0 


0 


0.41 


0 


-0.26 


-0.15 


0 


0 


0 


0.45 


0 


0 


0 


0 


0 


0 


0.41 


0 


0 


0 


0 


0 


0 


0.37 



and its inverse (I-Q) ’ is 



2.22 


1.36 


0.95 


0.83 


1.19 


0.38 


0 


2.22 


0 


1.36 


0.95 


0 


0 


0 


2.45 


0 


1.57 


0.97 


0 


0 


0 


2.22 


0 


0 


0 


0 


0 


0 


2.45 


0 


0 


0 


0 


0 


0 


2.73 



Using (9.27) we obtain that the expeeted duration of the operation is 
£[riwe] = 3x(2.22 + 1.36 + 0.95 + 0.83 + 1.19 + 0.38) + 3x0.5 = 22.3 min 
The probability distribution is obtained from the first row of (I - (9) ’ R : 



States (0,6) 


(0,5) 


(0,4) 


(0,3) 


0.23 


0.46 


0.27 


0.04 


0.37 


0.49 


0.14 


0.00 


0.00 


0.41 


0.47 


0.11 


0.61 


0.39 


0.00 


0.00 


0.00 


0.64 


0.36 


0.00 


0.00 


0.00 


0.68 


0.32 



The probability that eaeh one of the three UCAVs kills a VT (state (0,3)) is 
0.04, while the probability that none is sueeessful (state (0,6)) is 0.23. 
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9.4 Summary, Extensions and Further Reading 



In this chapter we presented two types of UAV models: prescriptive models for 
UAV routing in Section 9.2 and descriptive models for autonomous UCAV de- 
ployment in Section 9.3. These simple and basic models exhibit major factors that 
dominate UAV operations - time, survivability, detection capabilities, and mem- 
ory - and highlight the main features of UAV modeling - routing, scheduling, and 
effectiveness assessment. These models can be expanded in several ways, de- 
scribed briefly below. 



9.4.1 Optimizing the Employment ofUAVs 

While the models in Section 9.2 reflect realistically the objective functions of 
UAV operations such as time to complete a mission, probability of mission com- 
pletion and the probability of detecting a target, they ignore many constraints that 
are typical in such settings. UAVs may have limited endurance due to logistical 
constraints, therefore refit and refuel actions must also be represented in an opti- 
mization model. There may also be physical constraints, which are associated with 
the specific flying capabilities of the UAV, not just the topography, that will limit 
its maneuverability and thus constrain its feasible routes. Also, the models pre- 
sented are two dimensional in the sense that only the projection on the ground of 
the location of the UAV - the ( i, j ) coordinate - is taken into account. In reality 

the altitude of the UAV affects its capability to accomplish its mission and there- 
fore must also be taken into account. 

Moreover, the models in Section 9.2 optimize the employment of only one 
UAV. In many combat situations the ground force may operate more than one. 
The challenge in such situations is to efficiently coordinate the route and schedule 
of each UAV such that the objective function is optimized. In such multiple-UAV 
situations additional constraints relate to flying safety - deconfliction among the 
flying patterns of the various UAVs, control - assigning UAVs to ground control 
centers and, most of all, managing and fusing effectively the information obtained 
from the UAVs’ sensors. Kress and Royset (2008) present a model that addresses 
some of the extensions mentioned above in a multiple-UAV setting. Alighanbari 
and How (2008) present a robust approach to task assignment of UAVs, and Ras- 
mussen and Shima (2008) develop a tree search algorithm for assigning cooperat- 
ing UAVs to multiple tasks. Optimal deployment and employment of UAVs for 
search missions is a special case of search problems addressed in Chapter 7. Dell 
et al. (1996) study multiple searches in constrained path moving-target settings 
that might be applied to UAVs. 
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9.4.2 Unmanned Combat Aerial Vehicles 

In Section 9.3 we presented two descriptive models for a single UCAV. In reality 
UCAVs, in particular disposable UCAVs, are deployed in swarms that loiter over 
the target area. Similarly to multiple search UAVs, a key question is how to coor- 
dinate the UCAVs and what is the effect of such coordination on the outcome of 
the mission. A simple extension of the model in Section 9.3.1 is where a swarm of 
disposable UCAVs attack a cluster of targets (e.g., armored vehicles in a battal- 
ion). Similarly to the Markov model in Section 9.3.2, a state of the battle would be 
represented by the pair (n,k) where n is the number of UCAVs and k is the num- 
ber of VTs. However, unlike the single retrievable UCAV with multiple munitions 
in Section 9.3.2, absent coordination, two or more disposable UCAVs may acquire 
simultaneously and independently the same VT and thus waste valuable attack re- 
sources. This situation leads to a more complex Markov model that is described in 
Kress et al. (2006b). 

The models described in Section 9.3.2 and in the paper mentioned above are 
limited to homogeneous targets and homogeneous UCAVs. At significant compu- 
tational cost, these models may be extended to account for non-homogeneous tar- 
gets and multiple types of UCAVs. Another interesting and potentially important 
extension is to incorporate decision rules where the UCAVs manifest some level 
of cognitive capability. Specifically, in reality both sensitivity and specificity 
probabilities may depend on the time a UCAV spends investigating a target. This 
aspect is not captured in current models and may lead to interesting optimization 
models. 
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Exercises 

(1) Consider Example 1 and suppose that due to operational or safety considera- 
tions the UAV cannot fly over edge (0,1). Use sheet “Intelligence- 
Survivability” of Chapter9.xls to answer the following questions {Hint as- 
sign a very large value). 

(a) Can the mission be completed within 8 min with survivability probability 
not less than 0.7? Ans. No. 

(b) If the mission is to be completed within 7 min, what is the optimal route 
and what is the probability the UAV reaches its destination unharmed? Ans. 
(0,2),(2,6),(6,9),(9, 10); 0.353. 

(c) What is the fastest route if the required probability for mission comple- 
tion is 0.6? Ans. No route is feasible. 

(2) Consider Example 1. Based on additional intelligence information, it is de- 
cided that the route of the UAV must pass through waypoint 8. Write an ad- 
ditional constraint for (9.1) - (9.3) that reflects this requirement. 

Ans. Require that = 1 . The optimal solution should have the UAV 

flying from waypoint 8 to waypoint 9 (Xg^ = 1 ). 

(3) The military is considering adopting one of two possible disposable UCAVs: 
UCAVl, which is memoryless, with limited endurance but with high speci- 
ficity, and UCAV2 which has unlimited endurance, full memory but with 
lower specificity. Specifically, UCAVl ’s detection rate isZ = l, its endur- 
ance is r = 15 min and its specificity is r = 0.95 . The specificity of UCAV2 
is r = 0.7 . The sensitivity and kill probability are the same for both UCAVs, 
q = 0.7 and/? = 0.5, respectively. All other parameters (e.g., cost, mainte- 
nance, interoperability) are assumed to be equal. The reference scenario is a 
target area with 10 targets out of which 5 are VTs. Which UCAV is more ef- 
fective? Use the two “Disposable” sheets of Chapter9.xls. 

Ans. UCAVl. (UCAVl) = 0.465, P^(UCAV2) = 0.359 . 

(4) Consider Example 5 with T = 30 targets and K = \1 VTs. (a) compute the 
expected number of killed VTs, (b) plot a graph of the expected number of 
killed VTs as a function of the number of VTs in the target area. 

Ans. (a) 1.22. 




Chapter 10 

Terror and Insurgency 



Fighting terrorism is like 
being a goalkeeper. You 
can make a hundred bril- 
liant saves but the only shot 
that people remember is the 
one that gets past you. 

Paul Wilkinson 



10.1 Introduction 



So far we have studied “pure” military situations in the sense that all actors are es- 
tablished state military entities such as batteries of surface-to-air missile launch- 
ers, infantry battalions, air force squadrons, swarms of UAVs or navy ships. The 
military engagements, combat situations, and conflicts described in the previous 
chapters were typically between such entities. In this chapter we broaden the 
scope and consider non-state violent actors such as terrorist groups, guerillas, and 
insurgents, collectively called in this section - rebellions. 

Terrorist acts against the civilian population and insurgency activities against 
the state are committed by stateless individuals or organizations. Since the 
inhabitable parts of the Earth are partitioned into states, rebellions must 
necessarily reside in some “hosf’ state. If the host state is itself the rebellion’s 
target, then these stateless actors must be careful to disguise themselves from the 
state’s police or armed forces and to rely on the population for support. It follows 
that rebellions’ weapons must be relatively simple and small - they must maintain 
a small signature in order to thrive and operate. Rebellions do not maintain 
inventories of aircraft or tanks or ships, like many states do, but rely instead on 
small arms, improvised explosive devices, biological agents, poisons, and other 
methods of causing mayhem that do not require persistent exposure. 

Countering rebellions by a regular state force is sometimes called asymmetric 
or irregular warfare. The asymmetry is manifested in three main dimensions: 
force size, military capabilities, and intelligence. In terms of physical net assess- 
ment, the capabilities of the rebels are no match to those of the state forces; the 
number of combatants available to the regime forces is typically at least an order 
of magnitude larger than the number of active rebels (as of July 2007 there were 
more than 500,000 Coalition and Iraqi Security Forces operating in Iraq, while the 
estimate for the number of insurgents ranged between 15,000 and 70,000, 
O’Hanlon and Campbell, 2007). The state forces are usually better equipped and 
trained, and they operate more advanced and effective weapon systems than the 
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rebellions. The only advantage of the rebellions is their elusiveness and invisibil- 
ity; the regime forces have the military means and capabilities to effectively en- 
gage the targets, but they simply cannot find them. In other words, the regime 
forces lack situational awareness. The rebels, on the other hand, have many ex- 
posed targets to choose from and their situational awareness regarding their civil- 
ian and military targets is almost perfect. 

In this chapter we address three typical problems associated with terrorism and 
insurgency. Section 10.2 presents a model that estimates the effect of a suicide 
bomber in a crowded area. In particular, the effect of crowd blocking is modeled 
and evaluated. The second problem concerns biological terrorism in the form of 
intentional spread of contagious agents. Section 10.3 discusses this problem in the 
context of smallpox and presents an epidemiological model that is based on the 
classical susceptible-infected-recovered (SIR) model (Anderson and May, 1991). 
The epidemiological model describes a mass vaccination process and evaluates the 
number of casualties, as well as the required isolation capacity, as a function of the 
vaccination capacity and other parameters. Finally, in Section 10.4 we present a 
simple dynamic, Lanchester-based, model that describes the evolution of an insur- 
gency and of the attitude of the population in the presence of the regime’s counter- 
insurgency operations. 



10.2 The Effect of Suicide Bombing 



Terror events that involve suicide bombers (SB) are a major concern to regimes - 
mostly because of their psychological effect on the population, which may be am- 
plified by media coverage. Typical suicide bomb events occur in relatively small, 
crowded areas, henceforth called arenas, such as restaurants, buses, and bus sta- 
tions. AnSB enters an arena and attempts to detonate the bomb he carries so as to 
maximize the number of casualties. In this section we will construct a model that 
is intended to explore the effect of such a bomb as a function of various relevant 
parameters, especially the crowd density in the arena. 

One would expect that the number of casualties would increase with the density 
of the crowd, although at a decreasing rate, and would decrease with the size of 
the arena for a fixed number of people. The reason for the first assertion is that a 
higher density of people increases the probability that a random fragment of the 
bomb hits a person, and therefore it also increases the expected number of casual- 
ties. The second assertion seems reasonable for a similar reason; for a fixed crowd 
size, a larger arena implies lower density. 

We will show that this is not necessarily the case. Crowd blocking has a sig- 
nificant effect on the expected number of casualties. Crowd blocking occurs when 
some persons are shielded from the fragments of the bomb by other persons who 
stand between them and the SB. The SB model presented here is reported in Kress 
(2005a). An extension of the model that examines the effectiveness of SB mitiga- 
tion strategies is described in Kaplan and Kress (2005). 
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10.2.1 Dispersion of Fragments 

A typical SB carries an explosive charge (EC) mounted on a belt, which is con- 
cealed under a coat or a large shirt. The EC contains explosive and small pieces of 
metal, such as screws and nails. The SB attempts to position himself as close to 
the center of the crowd as possible, and then to set off the bomb. The metal frag- 
ments in the bomb disperse in a beam spray that hit people in the vicinity of the 
SB. Due to the relatively small amount of explosive (3^ kg) we assume that the 
energy of the fragments is such that a fragment may injure an exposed person or, 
with a lower probability, a person standing right behind an exposed person. The 
number of effective fragments - those which are potentially effective - is roughly 
one half the number of fragments on the belt. The other half is wasted on the SB 
himself. We assume that the explosive and fragments are uniformly distributed on 
the belt, which is depicted as a full circle; see Eigure 1 . 




Figure 1: Suicide belt aud its spray beam. 

Let 

N - Number of effective fragments in the beam spray 
R - Range from the belt of the SB 

b - Diameter of the suicide belt (the average “diameter” of a person's body) 

P - Dispersion angle of the effective beam spray 

PfR) - Probability that an exposed person at range R is hit 

First we compute the density of fragments at a range R from the SB. 



As shown in Figure 1, the beam spray is distributed vertically with angle a be- 
tween and +-y from the horizontal at the point of explosion. This is the cen- 
ter of the sphere in Figure 2. The arc length at range R \^ds = Rda. The surface 
area of a complete circumferential strip at angle a is R cos aRd a = 

2kR^ cosada. The total area of dispersion at range R is therefore 
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Figure 2: The dispersion area. 



(10.1) 



It follows that the density of fragments at range R is 



(^R 



N 



sin 



I' 

2 



(10.2) 



A person is represented in the model as a cylinder of diameter b and height cb, 
c > 1 . The cylinder’ s exposed area to the fragments A is approximately a rectangle 
of width b and height that is determined by the dispersion angle fi (see Figure 
3).That is, 



A = bMin{2R tan(y5/2), cb} . 



cb 




Figure 3: The exposed area. 



(10.3) 



Assuming uniform and independent dispersion of fragments, the probability 
that a person at range R is hit by at least one fragment is 
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Ph{R) = \- 1 



A 



(10.4) 



V 




From (10.2) it follows that 



1-T7— 




(10.5) 



We assume that the effectiveness of the explosive and the size of the arena are 
such that air resistance and gravitation have no significant effect on the trajectory 
and energy of the fragments. A fragment does not become harmless with distance. 



10.2.2 Modeling the Arena 

Some typical SB arenas, such as restaurants, may be depicted as a circular area 
around the SB. Let denote the distance between the SB (the EC belt) and the 
boundary of the arena. We assume that the crowd is distributed randomly and uni- 
formly in the arena. 

For modeling purposes, it is convenient to view the circular arena as a sequence 
of M concentric rings of width b. Each person occupies a round “slot” of diameter 
in a certain ring. In particular, the SB is located in the central slot (see Figure 
4a). 



Figure 4: The arena. 

The number of circles that can fit into the mth ring is determined by the angle 
between the line that connects the point of explosion with the center of a circle 
in ring m, and the tangent to that circle that passes through the point of explosion 




A 



B 
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(see Figure 4b). That is, 
= arcsin(l/2m) and 



In 

' 



But, sin 



bjl _ \ 
mb 2m 



therefore 



_ ^ 
arcsin(l/2m) ’ 



m = 



( 10 . 6 ) 



which is obviously independent of b. From now on we take b =\. 

It can be verified (at least for M < 200 , which is much more rings than we 
need for the SB scenario) that is integer only for m = \ {a^ = 6). Assume that in 
each ring we pack, with possible small overlaps, k^=\ a„, ] slots - A:^ is 
rounded up. Define the overlap factor of ring m by 

k —a 

(10.7) 

km 



For example, the overlap factors for m = 1,2, 3, 4, 5, 6 are 0, 0.04, 0.01, 0.04, 
0.02, 0.01, respectively. For higher values of M the gets even smaller since the 
numerator in (6) is bounded by 1 and the denominator is (strictly) monotone in- 
creasing in m. See Figure 5 for m = 1,2, and note the slight overlap of the circles 
in the second ring. 




Figure 5: Number of slots, k^ =6, k^= 13. 

Thus, the overlaps are marginal and they can be removed by slightly increasing 
the width of a ring. From now on we assume that the mth ring in the arena con- 
tains slots. 

Let, 



m 

K{m)='Zk„, 



m = 



( 10 . 8 ) 
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K{M) is the maximum possible number of people in the arena (excluding the 



SB). For example, if M = 10, then K{M) = 6+13+19+26+32+38+44+51+57+63 = 



349. Since we assume random homogeneous mixing in the arena, the expected 
number of people in the with ring is 







(10.9) 



where L is the number of people in the arena. 



10.2.3 The Effect of Crowd Blocking 

The number of casualties depends on the spatial distribution of people in the 
arena. Some people may become human shields to others by blocking the frag- 
ments. As shown in Figure 6, B1 totally shields (blocks) the target (T), and B2 
partially shields it. 




Figure 6: Partial and complete blocking. 



A target (person) is assumed to be protected against a fragment that is moving 
its way if and only if the target’s center is shielded from the beam spray. In other 
words, the target is safe if and only if there exists at least one person that blocks 
the line of sight (LOS) from the center of the SB’s point of explosion to the center 
of the target. B1 in Figure 6 protects the target, while B2 does not. Instead of de- 
scribing the situation in terms of persons, we will do it by looking at the slots in 
the various rings. Thus, a person (T) in ring m is protected if and only if there is at 
least one occupied slot in rings 1,. . ., m—l that intersects the line of sight SB - T. 

Since the slots in each ring are packed, there exists in each ring 1,..., m-1 ex- 
actly one slot that intersects the LOS SB - T. It follows that T is safe if and only if 
at least one of these m—l slots is occupied by one of the other L— 1 persons in the 
arena. Denote the probability of the complement of this event - the probability 
that a target in ring m is exposed and vulnerable to the explosion, given L people 
in the arena - by 7i{L, m) . Consider a certain person in ring m. Because of the uni- 
form and independent distribution of people in the arena, the probability that the 
first of the other L— 1 persons will not occupy a blocking slot is that he chooses one 
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of K{M) - m empty slots out of the K{M) - 1 available. The second person can 
choose from K(M) -m-\ empty slots out of K{M) - 2 available, and so on. Thus, 



7r(L, m) = \ 



nf > 



/=i V 



m-\ I 
K(M)-l ) 



[0 



if L<K{M)-m + 2 
otherwise 



(10.10) 



and it is easily seen that 1 = 7t{L,V) > 7r{L,2) k{L,M) . 

Casualties that get directly hit by a fragment are called primary casualties. The 
expected number of primary casualties in the mth ring is 



where from (10.5), 



P^m)= 



NMin {2m tan{^/ 2), c} 
4;rsin— 

1-e ^ 



The expected total number of primary casualties is 



M 



m=\ 



( 10 . 11 ) 



( 10 . 12 ) 



(10.13) 



Secondary casualties are persons located right behind exposed and hit persons. 
In terms of our model, person B (slot B) is susceptible to a secondary hit if there is 
exactly one slot A that blocks the LOS between the SB and B, and slot B is tan- 
gent to A. In other words, a person in ring m+1 might sustain a secondary hit if 
there is a person in ring m that stands between him and the SB. We assume that a 
secondary injury occurs independently, with probability q, provided the immediate 
shield has been hit. The situation where a secondary hit is possible in ring m hap- 
pens when the slots in rings m-\ and m on the LOS are occupied, and slots 
l,...,w-2on the LOS are not. The event A = {Slots I,..., m-2 along the LOS 
are empty] is the union of two disjoint events: B = [Slots I,..., m-l along the 
LOS are empty) and C = {Slots I,---, m — 2along the LOS are empty “and” slot 
m-l is occupied}. C is the event of interest and P{C) = P{A) - P(B) = 
7t{L,m-V)-7t{L,m) . Clearly, this event is possible only from the second ring on. 
Thus, a person in ring m is prone to primary hit with probability n{L, m) and to 
secondary hit with probability n{L, m-l) - n{L, m) . The probability that this per- 
son is hit by a fragment is n{L,m)Pfj(m) + q{n{L,m-X)-n{L,m)P^(m-\) . It 
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follows that the expected number Ei of primary and secondary casualties in ring 
m is 



{EL,m m = \ 

[ \_7t{L, m)Pfj (m) + q(K{L, m-\)- k{L, m)))Pfj (m - 1)] otherwise 

(10.14) 



10.2.4 Analysis 

Figures 7 and 8 plot the values of as functions of the crowd size L for the 

cases of perfect blocking {q = 0) and partial blocking {q = 0.5) . We examine two 
arena sizes, M = 10, 20, and two values of the spray beam angle, j3 = 10° and 60° . 
We assume that N = 100 fragments, and on average the height of a person is 3.5 
times his width (c = 3.5); see Sheet “SB” in Chapterl0.xls. 




Figure 7: Expected number of casualties. M = 10, P = 10° (A), 60° (B) 





A B 

Figure 8: Expected number of casualties. M = 20, P = 10° (A), 60° (B). 



Figures 7 and 8 demonstrate that the effectiveness of the suicide bomb does not 
necessarily increase with the size of the crowd in the arena. Beyond a certain 
threshold, the expected number of casualties gets smaller. This phenomenon is at- 
tributed to crowd blocking, which becomes more significant as the density of the 
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crowd increases. Note that in the case of perfect blocking {q = 0, primary casual- 
ties only) the expected number of casualties decreases to 6. This result is true in 
general since the effect of the explosion is limited to the first ring (with = 6) 
when the arena is fully crowded. The effectiveness of the explosion also depends 
on the size of the spray beam angle. When this beam is narrow, the effect is 
stronger than when it is wider. Note also that the addition of secondary casualties 
is significant only when the arena becomes crowded. In a low-density arena only 
direct hits of the fragments affect the number of casualties. 

Figure 9 depicts the effect of the size of the arena on the expected number of 
casualties. We assume a crowd of L = 100 and we vary the size of the arena be- 
tween M = 6 (the minimum size arena that can contain 100 people) and M = 50. 
We assume perfect blocking (q = 0). 




Figure 9: Expected number of casualties as a function of the arena size, 



From Figure 9 we see that the size of the arena affects the damage in a non- 
monotone way. For a certain crowd (L = 100 in our example), there is a capacity 
(M = 25) for which the damage is maximal. 

While the results shown in Figures 7-9 seem to be consistent with data regard- 
ing real SB events, a rigorous statistical analysis to confirm the numerical results 
of the model is difficult, if not impossible. First, there is no reliable record on the 
size of the crowd L in the arena at the time of the real event. This number can be 
estimated, at best, based on interviews of eyewitnesses. Second, while fragments 
are the main cause for fatalities in an SB event, some victims may have been 
killed by blast effects. Also, although some of the recorded injuries are related to 
mental shock and secondary injuries, such as cuts and bruises from debris, many 
of them may have been caused directly by the fragments. Third, the number of ef- 
fective fragments in a suicide belt can also be only estimated. Fourth, the position 
of the SB in the arena affects the results, too. Incidents with relatively few casual- 
ties were typically consequences of a partial successful interdiction of the SB, 
where a guard identified the SB at the door, and as a result, the latter blew himself 
up outside the main part of the arena. 

One possible takeaway from this model is that a natural thing to do in case 
of an imminent threat of an SB - run away - may not be the best strategy 
from the social welfare perspective. In some realistic situations such a response 
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may actually increase the expected number of casualties. The instinct of the sol- 
dier who throws himself on a live hand grenade in order to protect his comrades is 
correct. Another takeaway is that such a model and analysis can help us recognize 
situations most attractive to an SB, and thus better take protective measures. 



10.3 Response Policies for Bioterrorism - The Case of Smallpox 

Responding to bioterror events that involve contagious agents is of major concern 
for authorities in their war against terror. There are many operational and logistic 
decisions that must be made in order to effectively cope with such a threat. These 
decisions are roughly divided into two levels: structural (strategic) decisions that 
are made in advance, before a possible bio-attack, and operational (real-time) de- 
cisions that must be made during such an event. 

Some of the structural problems are as follows: 

• decide how many vaccines to produce and stock, 

• select the supply management policies for allocating, deploying, and 
controlling inventories of vaccine doses and other related supplies, 

• set up the infrastructure (vaccination stations, quarantining facilities, 
transportation capacity, etc.), 

• choose the vaccination procedure (e.g., inoculation only, pre- vaccination 
screening for contraindication), 

• determine the manpower requirements and personnel assignment. 

The operational (real-time) issues include 

• detecting and identifying the type of the bioterror event, 

• managing the contact tracing process (if applied), 

• prioritizing efforts with respect to monitoring, isolating, quarantining, 
tracing, and vaccinating, 

• coordinating the supply chain of vaccines and other supplies, 

• identifying bottlenecks and potential congestion in the response proc- 
ess, 

• determining service capacities and setting service rates. 

One of the most critical decisions - a decision that has both structural and op- 
erational implications - is which vaccination policy to adopt. The vaccination pol- 
icy decision has two levels. At the first level, policymakers must choose between 
essentially two options: a preemptive approach in which the entire population 
is pre-vaccinated, and a “wait-and-see” approach where post-attack emergency 
response (vaccination, curfew, isolation) commences following an outbreak of 
the disease. Mixtures of these two options are possible, i.e., pre-vaccination of 
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first responders (e.g., health-care and law-enforcement personnel) only. Sociologi- 
cal and psychological considerations (is there a real threat or just a perceived 
one?) coupled with medical considerations (fear of vaccination side-effects) often 
hinder policy makers from taking any significant preemptive action. 

If no significant preemptive measures are taken, the question at the second 
level is which post-event vaccination policy to adopt. The two main policies are 
mass vaccination and trace vaccination. In mass vaccination, maximum vaccina- 
tion capacity is utilized to uniformly inoculate the entire population. In trace (also 
called ring or targeted) vaccination, only limited vaccination capacity is utilized to 
selectively inoculate contacts (or suspected contacts) of infected symptomatic in- 
dividuals. In this section we develop a descriptive model for analyzing the effec- 
tiveness of mass vaccination for a highly contagious smallpox epidemic. 



10.3.1 The Epidemic and Possible Interventions 

Suppose that a terrorist releases smallpox viruses in a public area. The authorities 
are not aware of the event until a certain number of symptomatic patients are re- 
ported and diagnosed. Once the epidemic is detected and identified, a response is 
initiated, which may involve isolation, quarantine, tracing contacts of infected 
(and diagnosed) persons, and vaccinating some or all of the population. Smallpox 
has an incubation period during which an infected individual is asymptomatic and 
not contagious. The incubation period is divided into two periods of time: the im- 
munable (also called vaccine-sensitive) period and the non-immunable period. We 
assume that vaccination is perfectly effective during the immunable period, and 
also for uninfected persons. During the non-immunable period, vaccination is not 
effective, and therefore the infected person will eventually become ill. Once the 
incubation period is over, the infected person becomes symptomatic and conta- 
gious, as long as the symptoms persist. Recovered patients of the disease are im- 
mune and not contagious. 

Figure 10 presents the stages and progression of the disease. At any given time 
f, the population of non-vaccinated individuals is divided into six possible stages: 
S - susceptible to the disease; A - infected, not contagious, and immunable; B - 
infected, not contagious, and not immunable; I - infected, contagious, and not iso- 
lated; Q - infected, contagious, and isolated; R - removed (recovered, vaccinated, 
or dead). Figure 10 presents the transitions among the stages. Once the epidemic is 
identified, detected (symptomatic) infected individuals (in stage 1) are isolated 
(moved to stage Q) and the vaccination process commences. Only asymptomatic 
persons, in stages S, A, and B, are vaccinated. While the vaccination is effective 
for individuals in stages S and A, it is ineffective for stage B, as shown in Figure 
10. A detected symptomatic person (in stage I) is immediately isolated. 
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Figure 10: Stages and progression of the epidemic. 



10.3.2 A Model for Mass Vaccination 

The model for mass vaccination is based on the general SIR model for epidemics 
(Anderson and May, 1991) where only the stages S, I, and R are considered. From 
now on we use the same labels to indicate the name of the stage and the set of per- 
sons in that stage. The SIR model is a set of ordinary differential equations (ODE) 
similar to the deterministic Lanchester models of Chapter 5. However, in this 
chapter we present the models in their difference equations form. This form is 
more attune with the typical time resolution of a smallpox mass vaccination proc- 
ess - days - and is more manageable computationally 

5(t-H) = 5(0(1 -T/(0), 

i{t+\) = im+rs{t))-p), (10.15) 

R{t + \) = R{t) + pI{t), 

where the parameter t represents days. 

The number of newly infected depends on the interactions between suscepti- 
ble S (from now on we drop the argument f, except for emphasis) and infected 
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1 persons, which is measured by the product of these two variables. The parame- 
ter 7 represents the intensity of the interactions. The probability that an infected 
person is removed in 1 day is , so the average number removed is pi . Thus, the 
rate of decrease in the number of susceptible persons is ySI, and the net change in 
the number of infected persons is ySI - pi . We assume that the total population is 
fixed; S + 1 + R = P . We are employing expected value analysis (EVA) here, as 
described in Chapters 1 and 5, our justification being that the populations involved 
are mostly large. Results may be inaccurate when populations are small; in fact, 
(10.15) can even report negative values for state variables in some cases. 

In the case of smallpox we also need to take into consideration the incubation 
and isolation stages. Thus, here we assume that S + A + B + I + Q + R = P. Let co 
denote the daily vaccination capacity and assume perfect efficacy of the vaccina- 
tion. Recall that only persons in stages S, A, and B are vaccinated. People in stage 
I who show up in a vaccination center are assumed to be processed (e.g., registra- 
tion, screening) like the others, diagnosed and then isolated. Despite not being 
vaccinated, these persons consume vaccination capacity. Since the vaccination ca- 
pacity is fixed atty but the values of S, A, B, and I change over time, and since we 
assume that the vaccination queue is homogeneous in the sense that it reflects the 
distribution of stages in the population not yet isolated or removed, the probability 
that on a given day t a certain person in the population passes through a vaccina- 
tion center is V(t) = Min{l,a>l(S(t) + A{t) + B{t) + 1(1))). Thus, the number of per- 
sons in stage I that are isolated in the vaccination center is IV. A fraction A. of 
stage I persons that are not scheduled to appear in a vaccination center on a certain 
day show up voluntarily at a medical facility other than a vaccination center (fam- 
ily physician office, emergency room) are diagnosed and isolated. This is the off- 
line isolation process, where /l/(l - E) infected individuals are being diagnosed 
and isolated before arriving at a vaccination center. Isolated persons are removed 
(die or recover) at a rate p. Let a and {3 denote the daily transition rates 
from stage A to B, and from B to I, respectively. The set of difference equations 
that describes the evolution of the smallpox mass vaccination process is 



5(0-t5(0/(0- S(t)V(t) , 


(10.16) 


Susceptibles Susceptibles that are 

that are infected vaccinated 




m)m 


- aA(t) - A(t)V(t) , 


(10.17) 


Newly 

infected 


Persons moving Stage /I persons 

from Stage ^ to S that are vaccinated 




: aA{t) - 




(10.18) 


New persons 
in Stage B 


Persons that 
move to Stage 7 




m\ - 


I(t)V(t) - Al(l-V(t)) , 


(10.19) 


New persons 
in Stage I 


Contagious persons isolated Contagious persons isolated before 

via the vaccination process reaching the vaccination center 
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Q{t + \)= I{t)V{t) + n{t){\-V{t))- pQ{t^ , (10.20) 

Newly isolated persons Newly isolated persons not P^sons removed 

from vaccination centers from vaccination centers isolation 

7?(f + l)= pm + {S{t) + A{t))V{t) . (10.21) 

Persons removed Effectively vaccinated persons 
from isolation 



The subpopulation S (10.16) decreases over time due to contraction of the dis- 
ease and vaccination. The number of persons in the first stage of incubation (A, 
see (10.17)) increases due to infection and decreases by transiting to stage B or to 
stage R, due to vaccination. Equation (10.18) represents the dynamics at stage B\ 
the set of people at this stage is fed by people from stage A and releases infected 
people, who become symptomatic, to stage I. The changes in the set of infected, 
symptomatic and contagious people I are shown in (10.19). New symptomatic and 
contagious people are generated from stage B, and existing such people are re- 
moved, either at the vaccination center or by reporting to medical facilities. The 
set of isolated people (10.20) increases at the rate of isolation (see (10.19)) and 
decreases by people who either recover or die. Finally, the “absorbing” stage R is 
fed by a flow of people removed from isolation and effectively vaccinated per- 
sons. 

For more detailed modeling of smallpox vaccination processes, see Kaplan et al. 
(2002), Kress (2005b), and Kress (2006a). 



10.3.3 Numerical Example 

Consider a population of 10 million people. The number of initially infected by 
the bioterror attack is A(0) = 1000 people. The epidemic parameters (see Kress, 
2005b, 2006a) are shown in Table 1. The difference Equations (10.16)-(10.21) are 
solved in sheet “MassVacc” in Chapterl0.xls. Figures 11 and 12 show the evolu- 
tion of the epidemic over time for various vaccination rates. Figure 1 1 presents the 
number of infected and contagious people (stage I) and Figure 12 presents the 
number of infected people in isolation (stage Q). 

First we make the (optimistic) assumption that the authorities become aware of 
the bio-attack immediately, that is, the mass vaccination process is initiated im- 
mediately after the attack. In reality the bioterror attack would most likely be cov- 
ert and therefore it will take a while until the first symptomatic patients (stage I) 
are detected (this situation is considered later). 
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Value 


Definition 


Symbol 


10“’ 


Infection rate 


r 


0.33 


Transition rate from stage Ato B 


a 


0.087 


Transition rate from stage Bto I 


P 


0.33 


Transition rate from stage I to Q (off vacci- 
nation process) 


X 


0.083 


Transition rate from stage Qto R 


P 



Table 1: The epidemic parameters (all rates are per day). 

The newly infected stage I casualties are /iB (see (10.19)). Figure 11 presents 
the newly infected and symptomatic people (Stage I), called henceforth - casual- 
ties, for three daily vaccination capacities: (O = 500K, 300K, and lOOK. 




Days 



Figure 11: Newly infected casualties for three vaccination capacities. 

The total numbers of casualties are 1508, 2823, and 33,576 for daily vaccination 
capacities 500K, 300K, and lOOK, respectively. Clearly the effect is nonlinear. For 
example, if the daily vaccination capacity is reduced from 500K to lOOK, the 
number of casualties increases by a factor of over 22. The peak of the epidemic 
occurs on day 12 (60 new casualties) when the daily vaccination capacity is 500K, 
and on days 21 (85 new casualties) and 65 (563 casualties) for daily vaccination 
capacities 300K and lOOK, respectively. Also, the epidemic is eradicated after 63, 
78, and 156 days for the three vaccination capacities, respectively. 

An important insight from this model and analysis is logistical and relates to 
the question how big should be the isolation capacity. Figure 12, which looks very 
much like Figure 11, presents the number of casualties in isolation. The required 
isolation capacity is determined by the peak of these graphs. If the daily vaccina- 
tion capacity is 500K then the isolation capacity needed is 589 beds. If it is 300K 
people per day, then the isolation capacity needed is 913 beds. However for daily 
vaccination capacity of lOOK, the isolation capacity needed is 6266 beds. 
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Days 

Figure 12: Casualties in isolation (Stage Q) for three vaccination capacities. 

In reality the vaccination process will commence a few days after the attack, 
when the first symptomatic people show up in medical facilities and the authori- 
ties set up the vaccination process. Let d denote the time that elapses from the day 
of the attack to the day when the vaccination process commences. Figure 13 pre- 
sents the number of newly infected for three delay scenarios: no delay, 4 days, and 
8 days. Assume that the daily vaccination capacity is 500K. The total numbers of 
casualties are 1508 (see Figure 11), 2281, and 3289 for r/ = 0, 4, and 8 days, 
respectively. 




Days 

Figure 13: Newly infected casualties for three detection delay scenarios. 

Notice that the delays in the epidemic detection have little effect on the dura- 
tion of the epidemic. Figure 14 presents the time-dependent number of casualties 
in isolation. 
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Days 

Figure 14: Casualties in isolation (Stage Q) for three detection delays. 

The isolation capacity needed when r/ = 8 is more than twice the capacity 
needed when there is no delay. 



10.4 Counterinsurgency 



Insurgents take advantage of their small signature and elusiveness to launch at- 
tacks against the regime. Their effectiveness and ability to survive and operate de- 
pend on their ties with the general population. The insurgents get new recruits 
from the population and they rely on popular support for logistics and safety. As 
mentioned in the introduction, the key advantage of the insurgents is their elusive- 
ness and invisibility. The regime forces, while having a significant advantage in 
terms of firepower and technology, simply cannot find the insurgents targets. As a 
result, not only may these targets evade the regime’s attacks and continue their in- 
surgency actions but also collateral damage caused to the general population from 
poor targeting by the regime’s forces may generate an adverse response against 
the regime, and popular support for the insurgents. This popular support translates 
into new cadres of recruits to the insurgency. On the other hand, because of the 
large signature and exposure of the regime forces, the insurgents have perfect situ- 
ational awareness and therefore their attacks against the regime forces are focused 
and effective. The insurgents also execute coercive actions against the population 
(e.g., suicide attacks, car bombs) with the objective to force the population to align 
with the insurgency. However, these actions may have mixed effects. The attitude 
of the population, which has a significant effect on the fate of the insurgency, is 
greatly affected by the sense of security (Lynn 2005, Hammes 2006). The popula- 
tion will align with the side that is perceived as better protecting it, or at least less 
threatening. Therefore, the coercive actions against the population by the insur- 
gents may be a double-edged sword. On the one hand it may lead the population to 
believe that aligning with the insurgency will buy them protection against the in- 
surgents’ violent actions, but on the other hand it may instigate resistance to the 
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insurgency and generate support for the regime who fights the insurgents. In this 
model we assume no such coercive actions by the insurgents. The main goal of the 
regime is eliminating the insurgency. 

In the next section we present a dynamic model, which is a variant of a deter- 
ministic Lanchester system (see Section 5.2), for investigating cause-and-effect 
relations among the various parameters that affect the outcome of insurgency- 
counterinsurgency operations: force sizes, attrition rates, recraitment rates, popular 
support, and intelligence. 



10.4.1 The Model 

The model follows the dynamics of three groups: the insurgents I, the regime sup- 
porters in the population S, and the regime contrarians (insurgency supporters) in 
the population C. The fourth group - regime forces G - is assumed to remain fixed 
throughout the insurgency because the attrition the insurgents cause the regime 
forces does not affect significantly its size and capabilities. Thus, absent rein- 
forcement, the regime force G, remains constant throughout the insurgency. A 
similar assumption is applied to the population; the attrition to the insurgency and 
the general population (as a result of collateral casualties) do not affect signifi- 
cantly the size of the population, which is typically several orders of magnitude 
larger than the attrition to P. Therefore we also assume that the total population P 
= S + C + 1 remains constant. 

The insurgents are diffused in the population and therefore, absent intelligence, 
the measure of their signature as targets for the regime force, is IjP , which may 

be interpreted as the probability that a randomly selected target is an insurgent. If 
the attrition coefficient of the regime forces is y , then the attrition caused by the 

regime is yG (see Lanchester square law in Chapter 5), out of which a fraction 
7/P affects the insurgency and a fraction {S + C)lP affects the general popula- 
tion. Thus, the attrition of the insurgency depends on both G and I (see the guer- 
rilla warfare model in Chapter 5, and Deitchman (1962)). The insurgency recruits 
from the population of contrarians at a rate pC. The dynamics of the insurgency is 

represented by the following ordinary differential equation (in the following X de- 
notes the derivative of X with respect to time): 

i = -yGllP + pC. (10.22) 

The above equation assumes that the regime has no intelligence; it shoots in the 
dark and therefore it hits an insurgent with probability //P . In reality, the regime 

force would invest effort in gathering intelligence, which leads to enhanced 
situational awareness. Denote the intelligence level by p, 0 < p <l , where this 
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parameter may be interpreted as the probability that the regime has good intelli- 
gence, which facilitates effective targeting. Thus, with probability jj. the regime 

effort is focused on the insurgency targets and leads to a Lanchester square law 
(see Chapter 5) while with probability jJ. this effort is diffused and leads to the 

Deitchman guerrilla model (Deitchman 1962) mentioned above. Consequently, in 
the presence of partial intelligence, (10.22) becomes 

i = -YG{pi + {l-lil)HP) + pC , (10.23) 

and the rate of collateral casualties in the population is yG{l - jU) (1 -I fP ) . 

There are several factors that affect the attitude of the population toward the 
regime. These include the economic situation, civil liberties, and political leader- 
ship. But, as mentioned above, the most significant factor is the sense of security. 
This model focuses on the security factor, and a simple way to represent this factor 
is by the weighted difference between the attrition the regime causes the insur- 
gency, and the collateral casualties in the population, generated by the regime 
actions against the insurgency. This weighted attrition balance denoted by jd is 

p = yGip + {\-p) IIP) - vyGil - /t) (1 - I/P) = 

yG(M + (l-M)iil + v)I/P-v)). ^ ^ 

The weight parameter v represents the population’s cognitive trade-off between 
insurgency casualties and collateral innocent civilian casualties. The larger the 
value of V, the more sensitive the population to its own casualties. If P >0 then 

the attitude flow balance is directed from contrarians to supporters, and if P <0 , 
then the opposite is true. Notice that P >0 if and only 
if jU/ (I- jU) >v-{l + v)I/P . Assuming linear dependence on the attrition balance, 
and noting that S = P-C - 1 , we obtain that the supporters-contrarians dynamics 
in the population is captured by 

C = -Min{P, 0}r/pP-C-I)- (Max{ P, 0} 7]^ + p)C, (10.25) 

where and rj^ are the transition coefficients. The first term in (10.25) is the 
flow from ^ to C when P <() . The second term represents outflow from C to 5 and 
I. The set of equations (10.23) and (10.25) describes the evolution of the insur- 
gency. The variable S has been eliminated through the assumption that the total 
population P remains fixed. 
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10.4.2 Numerical Example 

Consider an insurgency that starts off with /„ = 2000 active insurgents. The re- 
gime force is G = 100,000. The population is P = 10^ out of which initially 20% 
are contrarians and 80% are supporters of the regime. The values of the other pa- 
rameters are summarized in Table 2. The time resolution of the model is days and 
the differential equations (10.23), (10.25) are solved as difference equations (see 
sheet “Counterinsurgency” in Chapterl0.xls). 



Value 


Definition 


Symbol 


1.5x10^" 


Regime attrition 


Y 


0.5x10 


Insurgency recruitment 


P 


2 


Population attrition trade-off 


V 


0.7 


Intelligence level 


p 


0.001 


5 — > C transition 


Ps 


0.001 


C — ^ 5 transition 


Pc 



Table 2: The insurgency parameters. 

The units of y, p, , and are inverse time, while V and p are dimen- 
sionless. 

Note that at the beginning of the insurgency 2.33 = p/(l-p) > v-{\+v)I j P = 
2-3x2000/10’ =1.9994 , that is !3>0. Therefore, right at the beginning of the 
insurgency, more contrarians become supporters than the opposite and the opposi- 
tion to the regime in the population fades away. 

Figures 15 and 16 depict the evolution of the insurgency, over a period of 
time of 7 years, for three intelligence levels: 0.7 (base case - see Table 2), 0.5 
(worse intelligence), and 0.9 (better intelligence). Figure 15 presents the insur- 
gency and Figure 16 presents the number of contrarians in the population. 



Insurgents 




Figure 15: Size of the insurgency for three levels of intelligence. 
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We see that the fate of the insurgency is very sensitive to the level of intelli- 
gence. When the intelligence is poor {fi = 0.5) then yff < 0 and the insurgency 
grows monotonically until the insurgents take over the regime. When ^ = 0.7, the 
insurgency grows for almost 4 years, reaches a size of over 40,000 insurgents, and 
then starts to decline. With very good intelligence (,u = 0.9)the insurgency 
reaches a maximum of just over 7500 insurgents after 190 days but is completely 
eradicated after less than 2.5 years. 



10000000 

8000000 

6000000 

Contrarians 

4000000 

2000000 

0 



Time 



)i = 0.7 

1T = 0.5 
)i = Q.9 




Figure 16: Number of contrarians for three levels of intelligence. 

Similar behavior is observed with regard to the number of contrarians C in the 
population. When /i <Q, C grows until there are no supporters left in the popula- 
tion. The slight decline in C after reaching its peak is due to ongoing recruitment 
to the insurgency. H> 0.7 , then P >Q, and the resistance to the regime in the 
population declines. However, when ji = 0.7, the decline in the number of con- 
trarians is not fast enough, therefore the insurgency can grow to over 40,000 (see 
Figure 15). Arguably, the recruitment coefficient p, which is fixed in our model, 
may be variable too; it may decrease when > 0. 

So far we assumed that the intelligence level p is fixed. In reality it may change 
over time and improve as the regime succeeds in infiltrating the insurgency ranks 
and gathers more intelligence. Suppose that the intelligence level is a sigmoid 
function of time; it grows initially slowly, then grows rapidly, until it ebbs off 
when the regime reaches its maximum intelligence-gathering capabilities. 
Let, /t(t) = a! {\ + be ~'“) , where p{bi) = apl + b) is the initial intelligence level, a is 
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the maximum possible intelligence level, and k is a. scale parameter that deter- 
mines the evolution of ju(t) between the two extremes, see Figure 17. When jU is 
variable, the sign of fi may change over time; while initially the insurgency grows 
for small values of//, it starts to decline later on when// gets larger. Figures 18 

and 19 present the evolution of the insurgency and the number of contrarians, 
respectively, when // behaves as in Figure 17. 




Figure 17: Intelligence level //(/) : a = 0.9, b = 6, k = 0.02. 



80000 

70000 

60000 

50000 

Insurgents 40000 
30000 
20000 
10000 
0 



Time 




Figure 18: Size of insurgency for variable intelligence level. 
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Figure 19: Number of contrarians for variable intelligence level. 

The initial sharp increase in contrarians, followed by increase in insurgency, is due 
to initial poor intelligence (/t(0) = 0.13). This trend is reversed when the intelli- 
gence is improved over time. The decline in C is much sharper than the decline in 
I because, according to our model, the insurgency decreases only because of attri- 
tion by the regime forces; it may increase due to popular support but it does not 
decrease solely because of lack thereof. 
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Exercises 

(1) Consider the SB model in Seetion 10.2. There are Z. = 50 in an arena of size 
M = 12. Beeause of the nature of the event, the erowd is eoneentrated in one 
ring. Compute the expeeted number of easualties if 

(a) the erowd is eoneentrated in ring m = 9 

(b) the erowd is eoneentrated in ring m = l2 (the walls of the arena) 

(e) the erowd is uniformly distributed in the arena, as in the model, and 
q = l (eertain seeondary injury) 

Use sheet “SB” in Chapterl0.xls. 

Ans. (a) 25; (b) 16; (c) 17.8 

(2) Consider onee again the SB model. Suppose that instead of loeating himself 
at the eenter of the arena the SB sets off the bomb at the boundary. Diseuss 
qualitatively the effeet of the attaek in this ease. Suppose that the arena eom- 
prises one ring and a eenter point (Af = 1). The erowd is four persons ran- 
domly distributed in the arena (L = 4), absent bloeking a person is hit with 
eertainty (Z^ = 1), and there are no seeondary easualties (q = 0). For the data 

in Seetion 10.3.3, eompare the expeeted number of easualties in the two see- 
narios: (a) the SB is at the eenter of the arena (see eenter part of Figure 5) 
and (b) the SB is in one of the six slots of the (outer) ring. Assume that a 
person is exposed to the SB only if the line conneeting the eenters of the two 
eorresponding slots does not toueh another oceupied slot. 

Ans. (a)4, (b) 3x — -i-2x — -i-lx— = 2.4 
15 15 15 

(3) Consider the bioterror model in Seetion 10.3. Obviously, timely deteetion of 
the attaek and effeetive isolation proeess ean reduee the number of easual- 
ties. Using sheet “MassVaee” in Chapterl0.xls, plot a graph that presents 
the total number of easualties as a funetion of (a) the deteetion delay d and 
(b) the off-line isolation rate A. . Use the data in Seetion 10.3.3. 

(4) Consider the numerieal example in Seetion 10.4.2, and suppose that the in- 
telligenee level p is eonstant as a funetion of time. Using sheet “Counterin- 

surgeney” in Chapter 10. xl?, answer the following questions: (a) what should 
be the minimum intelligenee level sueh that the insurgeney is eradieated af- 
ter at most 2.5 years? (b) If ^ = 0.8, what is the minimum regime foree size 
that will eradieate the insurgeney within 2 years? 

Ans. (a) 0.885; (b) 150,000. 




Appendix A 

Probability - the Mathematics of Uncertainty 



The single most useful field of mathematies in the eonstruetion of eombat models 
is mathematieal probability theory. This appendix is an elementary introduetion to 
the topie. 

A.1 Preliminaries 

Central to any diseussion of probability is the idea of an observed experiment. We 
might flip a eoin and observe whether it lands heads or tails, or shoot at a target 
and observe the miss distanee, or piek up the New York Times and observe 
whether the word “Pirates” oeeurs in the headline. Events are things that either 
happen or do not when an experiment is performed. Symbols for events are eus- 
tomarily upperease letters at the beginning of the alphabet, espeeially E. The prob- 
ability of an event P{E) is a measure of its likelihood when the experiment is per- 
formed. P{E) is measured on a eontinuous seale [0,1] where 0 denotes 
impossibility and 1 denotes eertainty. 

It is important to be elear about what the experiment is. Let E be the event 
that eertain developmental radar will deteet a speeified submarine periseope at a 
range of 5 miles in sea state 3. P{E) ean be just about anything, depending on de- 
tails sueh as how often the periseope emerges, how long it stays up, and the alti- 
tude of the radar. Unless there is a elear understanding of the experiment, state- 
ments about probability are pretty mueh meaningless. 

We often imagine that the experiment is repeatable, so that it ean be per- 
formed over and over. The outeome of the experiment may vary with the repliea- 
tion, but the experiment still has the same ehanees of produeing its various out- 
eomes on every trial. In that ease we ean approximate the probability of a simple 
event, say the probability of a eoin turning up heads, by eounting the number of 
heads that oeeur in n tosses and dividing by the total number of tosses. As the 
number of tosses beeomes larger and larger, this ratio approaehes eloser and eloser 
to the preeise value of the desired probability. In a similar manner, it would be 
possible to ealeulate the probabilities assoeiated with the faees of a die, the prob- 
ability of a newborn baby being a boy, and so on. Symbolieally, the true probabil- 
ity of any sueh event is 

number of times the event happens in n trials of the experiment _ 

P{E)=\\m„^^ . (A.l) 

n 

Probability theory eoneems the ealeulation of the probabilities of events in 
well-defined experiments. Probabilities are simply numbers, but events and ex- 
periments have yet to be given a mathematieal form. It turns out that the most use- 
ful form is to imagine that an experiment always produees an outeome that be- 
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longs to a set called the sample space, and that any event is simply a subset of the 
sample space. 

Example 1: Two navy battle groups, each comprising four ships, are under air at- 
tack. 

Experiment. Counting the number of damaged ships after the attack. 

Outcomes: (4,4), (4,3), (4,2), (4,1), (4,0), (3,4),. . .,(0,0). 

Sample Space: Comprises 25 outcomes {{ij) i j = 0,1,2, 3, 4} ({..} - symbol of 
a set). 

Event: E = {Total number of damaged ships is no more than 2} = {(0,0), 

(0,1), (1,0), (1,1), (2,0), (0,2)}. 

Example 2: Engage a target with five rounds of fire. 

Experiment: Observe the result of a shot. Stop after the first hit or when am- 
munition is used up. 

Outcome: A sequence of misses (M) that may or may not end with a hit (H). 
Sample Space: {H, MH, MMH, MMMH, MMMMH, MMMMM}. 

Event: E = {The target has not been hit} = {MMMMM}. 

We have seen that an event is associated with a set of outcomes. Sets can in- 
tersect or unite. The intersection of two sets A and B, denoted as 4 fl is the set 
of all the outcomes that are both in A and B. Consider Example 1 . Let A be the 
event “Battle Group 2 lost one ship = {(0,1), (1,1), (2,1), (3,1), (4,1)}, and let B be 
the event “Battle Group 1 lost two ships” = {(2,0), (2,1), (2,2), (2,3), (2,4)}. The 
intersection is the event “Battle Group 2 lost one ship and Battle Group 1 lost two 
ships” = {(2,1)}, the only outcome in both A and B. The union of two sets A and 
B, denoted as 4 U 5, is the set of all the outcomes that are either in 4 or in B. The 
union of the events 4 and B defined above is the event “Battle Group 2 lost one 
ship or Battle Group 1 lost two ships” = {(0,1), (1,1), (2,1), (3,1), (4,1) {(2,0), 

(2.2) , (2,3), (2,4)}. The complement of a set 4, denoted as 4 (or 4^ ), is the set of 
all the outcomes in the sample space that are not in 4. For example, the comple- 
ment of the event C = “Battle Group 1 lost at least two ships” = {(2,0), (2,1), (2,2), 

(2.3) , (2,4), (3,0), (3,1), (3,2), (3,3), (3,4), (4,0), (4,1), (4,2), (4,3), (4,4)} is C = 
“Battle Group 1 lost at most one ship” = {(0,0), (0,1), (0,2), (0,3), (0,4), (1,0), 
(1,1), (1,2), (1,3), (1,4)}. The union of C and C is the entire sample space, de- 
noted by S. 

A.2 Computing Probability 

The probability of an event depends on the composition of outcomes in its corre- 
sponding set. If all outcomes in the sample space are equally likely to occur, then 
the probability is simply the number of outcomes in that set divided by the number 
of outcomes in the sample space. 
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Example 3: A fair die is tossed once. The sample space is 5 = {1, 2, 3, 4, 5, 6} 
and because the die is fair all six outcomes are equally likely. The probability of 
the event E = “the outcome is an even number” is 



P(E) = 



Number of outcomes in {2, 4, 6} 3 1 



Number of outcomes in S 



6 2 



(A.2) 



Using the concepts and definitions discussed thus far, the following basic re- 
lationships can be derived for any two events A and B\ 

• The probability of an event is between 0 and 1 : 0 < P(A) < 1 . 

• P(A) = 1 - P(A) . The sum of probabilities of an event and its com- 
plement is 1 . 

• P(A f]A) = P(^) = 0 ((^ is the empty set). It is impossible that an 
event and its complement will occur together. 

• P{A \JA) = P(S) = 1 . Either an event or its complement must occur. 

• P(A U 5) = P(A) + P(B) -P(Ar\B). Implication: if A and B are dis- 
joint (no outcomes in common) thenP(^U^) = P(A) + P(B) . 



Example 4: A fair die is tossed once. A = “the outcome is an odd number” = {1, 
3, 5},B = “the outcome is smaller than 4” = {1, 2, 3}. P(the outcome is odd num- 
ber that is smaller than 4) = P(A fl ^) ^ Number of outcomes in the set 

{1, 3}/Number of outcomes in 5 = 2/6 = 1/3. A = “the outcome is not an odd 
number”= {2, 4, 6}. P(A[jA) = /’(“outcome is either an odd number or an even 
number”) =P(5)= 1. 

A.3 Conditional Probability and Independence 

The probability that two events, A and B, occur at the same time is the probability 
of the intersection of the two corresponding subsets of the sample space, as shown 
above. In this section we will compute this probability when the events are inde- 
pendent. First, however, the notion of conditional probability will be discussed. 

The conditional probability of one event, B, occurring given that another 
event. A, has occurred is denoted hy P{B\A) . The meaning of the words condition 

and given is that essentially the sample space S has “shrunk” to the set A. The term 
“Given A occurred” means that only outcomes that belong to A matter now; the 
other outcomes in S may be ignored because we know that they cannot occur sim- 
ply because A has occurred. Thus, the probability P{B\A) is the number of out- 
comes in B that are also in A, divided by the number of outcomes in A (which cur- 
rently is our shrunk sample space). If n{A{^B),n{B) and n{S) denote the number 
of outcomes that are both in A and B, in B, and in the entire (original) sample 
space, before it has been shrunk into A, then. 




240 



Appendix A 



r(j? Li) - ^ ^ ^ 

^ n(A) n(A)ln(S) P(A) 



(A.3) 



Equation (A.3) is very important; it leads to many other properties and useful 
results. We ean see right away that 



The expression in (A.4) is known as the multiplicative law of probability. 

Example 5: A fair die is tossed onee. What is the eonditional probability that the 
outeome is 2 given that the outeome is even? A = {2,4,6}, B = {2}, A[]B = 2. 

P(B \A) = P{A n B)IP(A) = (1 / 6) /(3 / 6) = 1 / 3 . 

The events A and B are said to be independent if the eonditional probability of 
event B given event A equals the uneonditional probability of event B. In other 
words, whether or not A is known to oeeur has no bearing on the probability of B\ 
oeeurrenee. In this ease, P{B\A) = P(B) and the multiplieation law simplifies to 
P(A[]B) = PiA)PiB). 

Example 6: When we say that two rounds of fire are independent of eaeh other 
we mean that the outeome of one shot does not alter the probability of sueeess of 
the other shot. In other words P(Shot 2 hits | Shot 1 hits) = P(shot 2 hits) and 
therefore P(Shot 1 hits and Shot 2 hits) = P(Shot I hits) X P(Shot 2 hits). 

A.4 Bayes Rule 

Suppose that in some experiment the event B ean oeeur only in eonjunetion 
with one of several mutually exelusive events A,,...,A„ . We know the probabili- 
ties P(A.), i = l,...,n and the eonditional probability P(BjA.), i = l,...,n, and that 
B oeeurs given that A. has oeeurred. Then the theorem of total probability allows 
us to ealeulate the uneonditional probability of the event B: 



Bayes Rule (or Bayes theorem) uses (A. 5) and the definition of eonditional 
probability to reverse the eondition between B and A^ . The eonditional probability 
of A. given B is 



P{A[^B) = PiB\A)P{A). 



(A.4) 



P{B) = P{B\A, )P(4 ) + ... + P(B \A„ )P(AJ. 



(A.5) 





(A.6) 



P(B\f)P(A,) + - + P(B\ A^ )P{A^^ ) ' 
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Example 7: There are three boxes, 1, 2, and 3. Each box contains 10 items of a 
certain product. There are 5, 6, and 8 good items in boxes 1, 2, and 3, respectively. 
The remaining items are defective. A box is chosen at random and then an item is 
drawn from it randomly. If the item drawn is defective, what is the probability that 
box A was chosen? To solve let B = {item is defective} and Aj = {box j is se- 
lected}, j = 1, 2, 3. Clearly P{Aj) = 1/3, j = 1, 2, 3. Also, 

P{B\A^) = 5/10, P{B\A^) = 4 / 10 , and P{B\A^) = 2/10. The desired probability, us- 



ing (A. 6) with n = 3, is 



PiA I B) = 



(5/10)(l/3) 

(5/10)(l/3) -r (4/10)(l/3) -r (2/10)(l/3) 



IT' 



Example 8: The probability that a certain object in a target area is a valuable tar- 
get is 0.3. An imperfect sensor observes the object and declares it to be a valuable 
target. The probability the sensor fails to recognize a valuable target is 0.2, and the 
probability it erroneously identifies a worthless target as valuable is 0.3. What is 
the probability that the object is a valuable target? Let V= {the object is a valuable 
target} and A' = {the object is not a valuable target}. Let SV denote the event that 
the sensor identifies the object as valuable target and SA the event that the sensor 
identifies the object as not valuable. We wish to compute P{V\SV ) : 



P{V I SV) 



P(SV\V)P(V) 

p(sv I V) P(V) + P(sv I v^) p(vA 



(0-8)(0.3) 

(0.8X0.3) 4- (0.3)(0.7) 



0.53. 



A.5 Random Variables 

A random variable is a transformation which associates with each point or 
collection of points in the sample space some real number. For example, in the 
case of two dice being tossed simultaneously, the outcomes could be represented 
very conveniently by a random variable, X, where X = the sum of the spots on the 
dice (possible values: 2,... ,12). Other possible random variables associated with 
the sample space generated by tossing two dice are the absolute difference be- 
tween the spots on the two dice (possible values: 0,...,5), the greater of the two 
numbers of spots showing (possible values: 1,...,6), the number of 3's showing 
(possible values: 0, 1, 2), or whether the outcome is either 7 or 1 1 (possible values: 1 
if either of those outcomes results, otherwise 0). In each case, the random variable 
specifies a number to be associated with each possible outcome of the experiment. 
A random variable is said to be discrete if it can take on only a finite or countably 
infinite number of values. Otherwise it is said to be continuous and usually takes 
on as values all the real numbers in some interval. 

A random variable is represented by two related functions that describe its 
probabilistic behavior. For a discrete random variable these two functions are 
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called probability mass function (pmf) (or in short, probability function) and cu- 
mulative distribution function (cdf). The pmf Px(x) is simply the probability that 
the value of a random variable X is x: p^ix) = P(X = x). The cdf is defined 
as Fx (x) = P{X < x). If there is no danger of confusion about which random vari- 
able is being described, the subscript X will be omitted in both cases. 

Example 9: A dice is tossed twice. Let X = sum of the spots. Then 
/?(5) = 4/36 = l/9 and/?(12) = l/36. Also 

7^(4) = p{2) + p{3) + /?(4) = 1/36-1- 2/36 -I- 3/36 = 1/6. 

The probability associated with a continuous random variable is characterized 
by a function / called a probability density function (pdf) or simply density func- 
tion. The area under this function represents probability and hence the total area 
under this curve is 1. The area/(x)Ax gives the approximate probability that the 
value of the random variable X will occur in an interval of length Ax around x. The 
cdf for a continuous random variable is 

X 

F{x) = P(X < x) = j f(x)dx. (A.7) 

It follows that lim F{x) = 0 andlimF(x) = 1, and 

X— X— 

X2 X, 

P{Xj<A<X 2 ) = l"‘^"f{x)dx= j f(x)dx- j f(x)dx = F(x 2 )-F{xf. (A.8) 

The pdf of a continuous random variable is obtained by differentiating the cdf 

Example 10: The distance between two adjacent mines is 1000 m. A ship 
can cross the line connecting the two mines anywhere on that line with equal 
probability and therefore any distance between 0 and 500 m from a mine 
is equally likely. Thus, /(x) = 1/500. The probability that the distance of 
the ship to the nearest mine is between 200 and 300 m is given by 

300 I 200 1 1 QQ 

P(200 < X < 300) = f dx- f dx = = .2. 

I 500 I 500 500 

The general term probability distribution is used to denote the particular prob- 
ability functions applicable to a given random variable. In the discrete case, the 
probability distribution may be specified by stating either the pmf or the cdf The 
discrete probability distributions encountered most frequently in this text are the 
binomial distribution, the Poisson distribution, and the geometric distribution, 
each of which is discussed separately later on in this appendix. A continuous 
probability distribution may be specified by identifying either its pdf or its cdf 
Among the most common of these continuous probability distributions are the uni- 
form (see Example 10), normal, and exponential distributions. These and the 
bivariate normal distribution, which is an example of a continuous joint distribu- 
tion of two variables, are also discussed in this appendix. 
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Similar to the definition of independence with respect to events (Section A.3), 
two random variables X and Y are said to be independent if 
P{X < X and Y<y) = P(X < x)P(Y <y) = F^(x)Fy(y). 

A.6 The Mean and Variance of a Random Variable 

As the name implies, the mean, also called expected value, of a random vari- 
able is a measure of "average value" of the random variable. It is somewhat analo- 
gous to the center of gravity used in physics. The symbols commonly used to rep- 
resent the mean of a random variable X are E[X\ and p. In the discrete case, the 
mean is calculated by taking the weighted average of all the possible values of the 
random variable, i.e., by multiplying each value by its respective probability of 
occurrence and then summing all of the resulting products: 

E{X) = p= X (A-9) 

All possible x,- 

Example 11: The number of targets X that are present in a certain target area is x 
with probability p{x) = 0.5"'^' /(I -0.5^), x = 0, 1, 2, 3. The expected value of the 
number of targets is 

o A 2 111 

E(A) = 0x — + lx — -)-2x — + 3x— = — . 

15 15 15 15 15 

The extension of the concept of mean value to the continuous case follows di- 
rectly in a manner completely analogous to the discrete case. The only difference 
is that the sum is replaced by an integral 

E{X) = iu= j xf{x)dx. (A.IO) 

All possible X 



Example 12: Consider Example 10. The expected distance between the ship and 
the nearest mine is 

500 

r X 

E(X) = dx = 250 m, as one would expect. 

J 500 

The expected value is a linear operator, that is, if a and h are any constants, 
then E(aA -I- h) = aE( A) -I- h. Furthermore, for any set of random variables (de- 

n n 

pendent or independent) Aj,...,A„, we have that A,) =^E(A,). 

1=1 1=1 

The variance of a random variable is a measure of its dispersion from the 
mean. Formally, 



V{X) = a^ =E{X-pf. 



(A. 11) 
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A useful formula for computing the variance is obtained by observing that 

E{X - juf = E{X^ -IjuX + ju") = E{X^)-2 ILiE{X) + = E{X^)~ ju\ (A.12) 

To illustrate the usefulness of the variance as a measure of dispersion, it is 
convenient to introduce a unit or measure called the standard deviation, o, which 
is defined as the square root of the variance. A probability distribution having a 
small and hence a small o would be concentrated tightly about the mean. On 
the other hand, a probability distribution having a large o would be dispersed more 
widely about the mean. 

Example 13: Consider Example 10. The variance of the distance between the ship 
and the nearest mine is 

500 2 

V(X)=E(X^)-u" = [—dx-25(f= 250' =20833.33 m. The stan- 

J 500 3-500 

dard deviation of this distance is (7 = -^V{X) = x/20833.33 = 144.34 m. 

It is easily shown that V (aX + b) = a^V (X). Note that adding a constant to a 
random variable does not change its variance at all. If A,,...,X„ are independent 

n n 

random variables, then =^F(X,). This relation does not hold if the 

1=1 1=1 

random variables are not independent. Note that it is variances that add when in- 
dependent random variables are summed, not standard deviations. Adding stan- 
dard deviations is almost always the wrong thing to do. For example, suppose a 
shot misses a target for two independent reasons, one that the gun is aimed at the 
wrong place (error X) and the other that the wind blows the shot around as it 
moves toward the target (error Y). The two errors have means of 0 and standard 
deviations 30 and 40 m, respectively. The total error is X+Y, but the standard de- 
viation of that error is not 70 m, but rather 50 m, the square root of the sum of 
900 and 1600 m'. Variances add, not standard deviations. 

The correlation coefficient of two random variables X and Y is 

p = P(X, T) = ^ EiXY)-EiX)EiY) ^ ^3^ 

where jU^ and jily are the mean values of A and F, respectively, and cr^^and f7j,are 

the standard deviations, respectively. If the two random variables are independent 
then /? = 0. If 0 then the two random variables are said to be correlated. 
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A.7 Discrete Probability Distributions 

In this section we describe certain widely used discrete probability distribu- 
tions: binomial, geometric, and Poisson. 



Binomial Distribution 



The binomial distribution arises from a sequence of simple independent trials 
in which the occurrence or the nonoccurrence of an event is the only item of inter- 
est. If, in such a sequence of trials, the individual probability of the event's occur- 
ring on any one trial remains constant through the sequence, then these trials are 
referred to as Bernoulli trials. A good example of Bernoulli trials is a sequence of 
coin tosses for which the probability of a head on a single toss is p and the prob- 
ability of a tail is q = l — p. Another example is when each of a sequence of tar- 



gets is engaged by exactly one round of fire. The shots are independent and the 
targets are equally vulnerable; the probability a target is killed following a shot is 
a constant p, where the probability it is still alive is q = \- p. In situations such as 



these, it is desirable to know the probability of a specified number of occurrences 
of the event in a given number of trials (e.g., the number of killed targets follow- 
ing n shots). An occurrence of the event is sometimes referred to as a success and 
a nonoccurrence as a failure. The binomial distribution provides the means of de- 
termining the probability of exactly x occurrences of an event in n independent tri- 
als. The probability mass function of the binomial distribution is 



P{X = x) = 






p q 



X = 0,1,..., n. 



(A. 14) 



where n is the number of trials, x is the number of successes, and 

x\{n-x)\ 

the number of combinations of n things taken x at a time. The expected number of 
successes is jU = np and the variance is V (X) = a^ = npq. 



Example 14: A shooter, having five rounds of fire, engages five targets, each with 
one round of fire. The probability that a round of fire kills a target is 0.7. The 

probability that exactly three targets are killed is P(X = 3) = 



The probability that 

^ 5 ^ 



P(X > 4) = 



v4y 



(0.7)"(0.3)‘ -h 



at 

v5y 



(0.7)" (0.3) =0.309. 
least four targets are killed is 
(0.7)" (0.3)“ = 0.528. The expected number of 



killed targets is ^ = 5x(0.7) = 3.5. 



Geometric Distribution 

It has been seen that the binomial distribution applies to situations where the 
random variable is the number of successes in n independent Bernoulli trials with 
a constant probability p of success on each trial. An associated random variable is 
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the number of independent Bernoulli trials until the first success occurs. The prob- 
ability of this number is given by the geometric distribution and its pmf is 



P{X = x) = {\-py-'p, x = l,2„ 



(A. 15) 



The expected number of trials until the first success is — and the variance is 

P 



V(X) = a^ = 



(I-P) 



Example 15: A shooter repeatedly engages a target until it is killed. The probabil- 
ity that a round of fire kills a target is 0.7 and the shots are independent. The prob- 
ability that the shooter will shoot no more than two rounds until his mission is over 
is P{X <2) = /h-(1-/?)/7 = 0.7-i-0.3x 0.7 = 0.91. The expected number of shots 
until the target is killed is 1/0.7 = 1.43 rounds. 

Poisson Distribution 

Another extremely useful probability distribution is the Poisson, which typi- 
cally applies in situations in which an event occurs repeatedly in a “completely 
random” or “haphazard” manner. A random variable X, which is Poisson distrib- 
uted is the number of such occurrences in a certain time interval. The probability 
function for a Poisson random variable with parameter X is defined by 



P{X = X) 




X = 0,1,2,... . 



(A. 16) 



A distinctive property of the Poisson distribution is that its mean equals its vari- 
ance, specifically, p = = X. The Poisson distribution approximates the bino- 

mial distribution when n is large, p is small, and X = np. Another important prop- 
erty of the Poisson distribution is that a sum of independent Poisson random 
variables has also a Poisson distribution. Specifically, if eachX.,/ = has a 

Poisson distribution with parameter A, , and Xj,...,X^ are independent, then 

fl n 

^ X; has also a Poisson distribution with parameter X = ZA- 
/=! /=1 

Example 16: The nutuber of emissions frotu a radioactive material is a Pois- 
son random variable with parameter X = 20t, where t is the time in hours. 
The probability that exactly five emissions occur in 1/2 h is derived by first 
computing the parameter (mean and variance) /I that corresponds to 1/2 h: 
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(20)(l/2) = 10 emissions. The probability of five emissions in 1/2 h is 

10'e^'“ 

P{X = 5) = = 0.038. 

5! 

A.8 Continuous Probability Distributions 

In this seetion we deseribe eertain widely used eontinuous probability distri- 
butions, where the random variable ean obtain any real value in a eertain interval. 
The distributions deseribed in the following are uniform, exponential, and normal, 
ineluding the bivariate normal. 

Uniform Distribution 

Consider an interval \a,b\ in whieh eaeh point is equally likely to be selected. 
Thus, the probability that a selected point in that interval is in the sub-interval 
[a,x\, a<x<h, is proportional to the length of [a,x\. Thus, the cumulative distri- 
bution function is 



F{x) = P(X <x) = 



b-c 



a<x<h. 



(A. 17) 



The density function is obtained by differentiating F(x)\ f(x) = . The 

b — a 

mean of the uniform distribution with parameters (boundaries) a and b is 



F = 



a + b . . 2 {b-af 

. The variance is <T = - 



12 



See Examples 10 and 13. 



Example 17: A target can be present anywhere on the interval [0,1]. The 
probability the target is located between the points 0.4 and 0.7 is 
P(0.4 <X< 0.7) = 0.7 -0.4 = 0.3. The mean point is 0.5. 

Exponentiai Distribution 

The exponential distribution has only one parameter A. This distribution is 
probably the most widely used in operations research. The cdf of the exponential 
distribution is 



F{x) = 1 — x>0 



(A. 18) 



and its pdf is f{x) = Ae . The mean of this distribution is 1//1 and the variance 

is 1/A^. An important property of the exponential distribution is that it is mem- 
oryless; the probability that the next event will occur, say, in the next 10 s is inde- 
pendent of the time elapsed since the last event. An exponential random variable is 
often used to model the (next) time of occurrence of “random” or “haphazard” 
events, e.g., the time it takes a “no-wear” part to fail, for a radioactive isotope to 
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emit a radiation blast or a customer to arrive at a bank. This distribution plays a 
key role in the Poisson process, described later on. 

Example 18: The time between two consecutive messages arriving at the intel- 
ligence cell in a brigade command post follows an exponential distribution with 
mean of 5 min. The probability that during a period of 7 min there is no message 

_7 

is P(X > 7) = 1 - F{1) = = 0.25. 



An interesting property of the exponential distribution is that the minimum of 
n independent exponential random variables has also an exponential distribution 
with a parameter that is the sum of n parameters. Formally, if X. are independent 
exponentially distributed random variables with parameter A,, / = then 

Y = min{Aj,..., is also exponentially distributed with parameter More- 

i=i 

over, the probability that this minimum is attained hyX. is ■ This situa- 

/ Al 

tion is called a race of exponentials and plays an important role in the theory of re- 
liability and in combat modeling. 

Normal Distribution 

The occurrence of many real-world phenomena can be conveniently ex- 
plained by the normal distribution, also called the Gaussian distribution. In gen- 
eral, the normal distribution can be said to apply to many situations involving 
measurements the frequency of whose values fall within symmetric patterns about 
a central value, with increasing frequency as one tends toward the central value. In 
particular, the sum of several random variables, of any distribution, tends to have 
this property, a fact which is formalized in the central limit theorem. For example, 
consider the values which might be obtained when taking any of the following 
measurements: (a) the heights of all men, age 21 or older, in a large city; (b) the 
grades received on the graduate record examination by all college seniors 
throughout the country; (c) the circumferences of all the full-grown trees in a large 
forest; and (d) the thicknesses of all the metal discs produced by an automatic 
machine in a 24-h period. The normal distribution has two parameters: the mean 
jU and the standard deviation <T. The pdf of the normal distribution, which has a 
distinctive bell shape, is 



f(x) 



1 







1 

2 







(A. 19) 



There is no closed form for the cdf A normal distribution with 
parameters jU and cr is denoted by N {jii, cr). 
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Standard Normal Distribution 

Any normal distribution can be standardized in order to use tabulated inte- 
grals for the special case where n = Q and a= The standardized normal distribu- 
tion is denoted by (0,1). It is common to denote the standard normal random 
variable by Z. The pdf of Z is 



m= 






exp 



/ 2 ^ 

Z 



, —oo<Z<°o. 



(A.20) 



The cdf of the standard normal distribution 



IS 



denoted by 









exp 



-—\du and is widely tabulated. In ExceF"^, the function is 



called NORMSDIST(). The transformation from a general normal distribution to a 

( X — U. 

standard one is straightforward: F{x) = O 

V <7 

Example 19: The daily number of mortar shells launched toward the headquarters 
compound is normally distributed with mean 20 and standard deviation 4. The 
probability that on any given day a mortar attack comprises more than 23 shells is 

P(X>23) = l-P(X<23) = l-of^^-^j = l-<I)f ^1 = 1-0.77 = 0.23. 



Bivariate Normal Distribution 

A random variable may have more than one dimension. For example, the 
point of impact of a bomb relative to a target positioned at the coordinates 
{jU^,jUy) is a bivariate random variable (X,Y), where X is the latitudinal error and 
Y is the longitudinal error with respect to the target. The latitudinal error can be 
negative (deflection to the left) or positive (deflection to the right) and the same is 
tme for the longitudinal error where “short” is in the negative direction with re- 
spect to the target and “long” is in the positive direction. If the error in each di- 
mension is normally distributed with means 0 and standard deviations 
<7^ and (Ty for X and Y, respectively, and if the correlation between the two errors 

is p , then the pdf of the point of impact is, letting K = 1 — == 

^2. 7T (T yy (7 Y ~ P 



f{x,y) = Kexp 



1 



2(1- p^) 



X 



-2p 



x-Px 



y~Fy 
V 7 



y-Py 






(A.21) 
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If the two random variables are independent then p = Q, and 



f{x,y) = 



1 



2 7T (T ^ (y Y 



-exp 






' X 



y-i^Y 



(A.22) 



An important result in this eontext is as follows: if the point of impact has 
bivariate normal distribution with means (^^,/t^)and standard deviation 
(<Tj,,<Ty) and the two errors are independent, then the probability that the bomb 
lands within the ellipse centered at , /tj, ) with latitudinal semi-axis 

— In particular, if 



length ro^ and longitudinal semi-axis length rcr^ is 1 - exp 
O jY = Oy = O then the pdf of the (Euclidean) distance of the point of impact from 
the target located at (^^,/tj,) is F(r) = l-exp 
Rayleigh distribution. 



f _^2 

— ^ I . F{r) is called the 



lo" 



A.9 Stochastic Processes 

A stochastic process (sometimes called random process) is an indexed collec- 
tion or sequence of random variables. A stochastic process is denoted as 
{X{t),te T} where T is called an index set. The most natural way to view T as 
time, but T could also represent other index sets such as spatial coordinates. The 
index set can be discrete, i.e., T = {0,1,2...} or continuous, T = [0,°°). The state 
space of a stochastic process is the set of all possible values of {X{t),te T). 



Example 20: The number of aircraft in a squadron that are mission ready on any 
given day is a stochastic process. The index set is discrete (days) and the state 
space is (0,1,. . .,«}, where n is the number of aircraft in the squadron. 

We will focus on a special type of stochastic processes, those which possess 
the Markov property. But before we define the Markov property and present its 
applications, we first present a simple and widely used special case, the Poisson 
process. 

A. 10 Poisson Process 

The Poisson process counts events occurring in time and is characterized by a 
parameter A>0, which is the rate of arrival of events (see Poisson distribution in 
Section A. 7). The random variable in that process, usually denoted hy N{i), counts 
the number of arrivals up to time t. Note that, as a function of time, N(t) is an inte- 
ger monotone non-decreasing step function; it has a jump whenever an event oc- 
curs. There are three assumptions that underlie the Poisson process: (1) the arrival 
of two or more events simultaneously is unlikely, there is always a time interval 
At > 0 between two consecutive events; (2) for any sequence of points in time 
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0 ,^,,...,^,, the process increments N(tj)- N( 0 ),N(t 2 )- N(t^),...,N(t^)- 

' V ' ^ V ' 

Number of arrivals Number of arrivals 
in the period (0,/| ] in the period (?| ,^2 ] 

are independent random variables, that is, the increments of the process are inde- 
pendent random variables; and (3) the probability distribution of 
N{t + s)-N{t) depends only on s (not on t), that is, the increments are stationary 
random variables. These three assumptions determine the probability distribution 
ofiV(t): 

= . (A.23) 

n\ 

The inter-arrival times in the Poisson process are exponentially distributed ran- 
dom variables, each with pdf / (t) = Ae~^‘ . 

Suppose that each event that arrives in a Poisson process is one of k types. 
The probability that an event is of type i, i = I,..., k, is q^. Then the arrival process 
of events of type / is also a Poisson process with arrival rate q^A. This property is 
sometimes called thinning of a Poisson process. 



Example 21: The arrival of soldiers to the base health clinic follows a Poisson distri- 
bution with rate of 10 patients an hour. Thirty percent of the patients are female. The 
probability that during a 2-h period at most five females will show up at the clinic is 



P(iV(2)<5) 



0! 1! ' 5! 



0.45. 



The Poisson process described above is homogeneous in the sense that the ar- 
rival rate A remains constant throughout the process. It does not depend on the 
specific time at which this rate is observed. A more general process, called non- 
homogeneous Poisson process, relaxes this assumption; the arrival rate A{t) is a 



non-negative function of time. In that case (A.2I) becomes 



P(Nit) = n) 




(A.24) 



A.1 1 Discrete-Time Markov Chain 

Probably one of the most applied stochastic processes in practice is Markov 
chain. A discrete-time Markov chain is defined by the states of the stochastic 
process, the discrete steps or stages in which the states of the process are ob- 
served, the initial probabilities of the states, and the transition probabilities that 
represent the likelihood of moving from one state to another in one step. Formally, 
a stochastic process (X„, « = 0,1,2,...} is said to be a Markov chain if for all states 
...,/„ and j, the probability that the next state is j, given a complete history of 
previous states, depends only on the most recent state: 
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II 

o 

2^ 

II 

a 




■ = P. 






I future 


past 


present 




future 


present \ 



. (A.25) 



In other words, the future is conditionally independent of the past, given the pre- 
sent. A Markov chain is said to have stationary transition probabilities if 

Pi^n+\ - j\^n - 0 = Pij ~ independent of the time step n. We will consider only 

stationary Markov chains. Let P denote the transition matrix of a Markov chain 
with m states as follows: 

Pn Pn • A™ 



P = 



(A.26) 



Pna Pn.2 



P 



mm 



The three properties of P are (1) P is a square matrix; (2) the sum of each 

row is 1 ; and (3) all entries in P are numbers between 0 and 1 . 

Given a vector ;r(0) = (;Tj(0),...,.;r„,(0)), where 

(0) = P(the process is initially at state /), the probabilities of the various states 

m 

at step 1 are given by ;r(l) = .;r(0)P , that is, k,{\) = -{(Pjp j,. Similarly, the 

y=i 

probabilities of the states at the nth step are given by 
K{n) = 7i:{n — 1)P = K{n - 2)P^ =■■■ = n{G)P" . It follows that an entry (n) in the 

matrix P" is the probability that the system moves from state i to state j in n steps. 

Example 22: A system can be in one of three states: good (G), bad {B), and down 
(D). Let denote the state of the system at the nth inspection. Thus, the state 
space is {G, B, D) and the steps are inspection instances. Initially the system is in 
state G and the transition matrix is 



G 


00 

o 


0.1 


o.r 


P = B 


0.2 


0.2 


0.6 


D 


.0.4 


0.5 


0.1, 



For example, if the system is in good condition, then the probability it is go- 
ing to be in bad condition at the time of the next inspection is 0.1. Similarly, if the 
system is currently down, the probability that in the next inspection it is fully re- 
paired and is back in good condition is 0.4. With probability 0.1 it remains down. 
Since it is given that initially the system is in good condition it follows that 
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k{Q) = (1,0,0). The probability vector of the state of the system at the second in- 



spection is 





"0.8 


0.1 


0.1" 


;r(2) = (1,0,0) 


0.2 


0.2 


0.6 




,0.4 


0.5 


0.1, 



(0.7,0.15,0.15). 



Thus, the probability that the system is in good condition two steps (inspections) 
from now is 0.7, while the probability it is down is 0.15. 

A.12 Continuous-Time Markov Chain 

Recall that in a discrete-time Markov chain the process is observed at fixed 
points in time called steps or stages. But what happens in between two consecutive 
steps? What would happen if instead of observing the system every hour we will 
observe it every half hour? 10 min? 1 s? 

As the name implies, in a continuous time Markov chain (CTMC) the proc- 
ess is observed continuously. Instead of considering the discrete-time stochas- 
tic process = 0,1,2,...} we consider now its continuous counterpart, 

that is, {X^ , t > 0} . In that case the Markov property becomes 
P[X{t + s) = j I path of the process in [0,t), X{t) = i] = 

P{X{t + s) = j\ X{t) = i\. The process has stationary transition probabilities if 
P{X{t -h ^) = j\ X{t) = i] = p^j (s) is independent of t. 

We have already seen one CTMC - the Poisson process N{i). Here we have 
P{A(t-i-5) = j\N{t) = {\ = P{N{s) = 7 -i),as given by (A.23). 

Any CTMC comprises a series of jumps that occur at certain points in time. 
Between any two jumps, say from state i to state j, the process remains unchanged 
at state i. The length of time the process spends at any given state i is called the 
sojourn time in state i. The Markov property implies that the sojourn times are in- 
dependent random variables having exponential distributions. The mean sojourn 
time generally depends on the state the process is in, although this is not the case 
for a Poisson process where we already know (Section A. 10) that the mean so- 
journ time is the same for all states and is equal to 1//1. The successive states vis- 
ited in a CTMC form a discrete-time (event-driven) Markov chain where a step 
represents a transition from one state to another. Thus, a CTMC is completely de- 
fined by a state transition matrix = {k^j) that specifies the transition probabilities 

from one state to another, and the mean sojourn time 1 / y in each state i. 

Example 23: In a Poisson process v, = X for all states i, and = 1 if j = i + l, and 
ky = 0 otherwise. 
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Example 24: A machine is subject to failure and repair. Repair times are inde- 
pendent identically distributed exponential random variables with mean Suc- 
cessive up times are independent identically distributed exponential random vari- 
ables with mean l/A Here = X and The state transition matrix is 

2x2 where = 1 and = 0. 

Example 25: Consider a duel between two combatants B and R. The time it takes 
B to kill R is exponentially distributed random variable with parameter h. The time 
it takes R to kill B is exponentially distributed random variable with parameter r. 
The states of this process are (1,1) (combat in progress), (1,0) {B wins), and (0,1) 



^(o,i),(o,i) = ^( 1 , 0 ), ( 1 , 0 ) = 1 kj ^ 0 for all other ij (see race of exponentials in the 
exponential distribution subsection of Section A.8). 

An important class of CTMCs are absorbing chains where some states are ab- 
sorbing; once the process enters one of these states it stays there forever. The 
other, non-absorbing, states are called transient states. The Markov transition ma- 
trix K of such a CTMC is of the form 



where/ is a unit matrix that corresponds to the absorbing states, Q is a square 
submatrix containing the transition probabilities among transient states, and R is a 
submatrix that represents transitions from a transient state to an absorbing state. 
Such a matrix has some very useful properties. First note that in the long run the 
process will end up in one of the absorbing states. The limiting probabilities of the 
various absorbing states are given in the matrix (/-(/) '/? . The rows of this ma- 
trix correspond to the transient states and its columns represent the absorbing 
states, thus the i,j entry of {I -Qf^R is the probability that the Markov process 
ends up in (absorbing) state j given it starts in (transient) state i. Second, the ex- 
pected number of visits to a certain transient state /, given that the process starts in 
(transient) state k is the k, I entry of the matrix (/ - (2) ' . 



{R wins). Here Vjj „ = b + r, v„ ^ = 0 , A:, 



’ AU),(1,0) 



bjib + r), =r/(b-fr). 




(A.27) 



Example 26: Consider Example 25. The duel between B and R is represented by 
the matrix 
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1 0 

0 1 

b r 



b+r b+t 



Here, ; = 1 is the absorbing state (1,0), i = 2 is the absorbing state (0,1), and i = 3 

b r 



is the (only) transient state (1,1). Also R = 



probabilities are (1-0) ' 



b + r b + r 



and Q = 0. The limiting 



b r 




b r 


_b + r ’ b + r _ 




_b + r ’ b + r _ 



and the expected number 



of visits to a bansient state is 1 - the initial state of the duel. 
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The main reason for modeling combat is to improve decision making, 
whether the decisions concern procurement, operations, tactics, or logistics. In cer- 
tain limited circumstances, it is possible to find the best possible decision. Meth- 
ods for doing this with the aid of a computer are the subject of this section. Com- 
putations will be illustrated using the ExceF'^ workbook AppendixB.xls. 

This appendix is a brief introduction to a topic about which there is much 
more to be said. For further details and more examples, see Winston (1994) or 
some other book that introduces operations research. 

B.1 Introduction 

John von Neumann and Morgenstem (1944) prove that “rational” decision 
makers will make decisions that maximize the decision maker’s expected “utility”, 
a scalar measure of desirability. The widespread use of computers to help make 
decisions through optimization is based on this foundation. In practice we may 
minimize, rather than maximize, since sometimes it is more convenient to deal 
with disutility (cost), rather than utility. 

If X is the decision to be made and t/(x) is the utility of the decision, the asso- 
ciated mathematical problem is to maximize C/(x) subject to x being feasible. The 
decision symbol is in bold type to emphasize that it may be a vector rather than a 
simple number. The utility function itself must be a scalar. If the number of possi- 
ble decisions is not too large, it may be possible to simply examine them all and 
select the one with the largest utility. In fact, the “sort” function in any spreadsheet 
might be considered the most elementary of its optimization functions, since sort- 
ing on utility will leave the best decision exposed. This might be called brute force 
optimization - simply list all the possibilities and choose the best. 

However, there are many problems where it is not attractive or even impossi- 
ble to explicitly list all possible decisions. In ExceF'^, this is handled by Solver, a 
tool that brings up a dialog box where the user is invited to select one utility cell to 
optimize (you must be explicit about whether you want to maximize or minimize), 
and one or more “adjustable” cells that Solver will adjust to optimize utility. These 
adjustable cells are Solver’s name for x, the decision, and may be subjected to 
various feasibility constraints. If you do not see Solver on the Tools menu, check 
the box on the list of Add-ins so that you will have it available. 

Example 1: Consider the maximization of the flmction/(v) = sin(x)/x, subject to the 
constraint that x must be at least 3. On sheet “Hill Climb” of AppendixB.xls, you can 
see a graph of that function, from which it is obvious that the maximum is at ap- 
proximately X = 8 (better values are available for small x, but x is required to be at 
least 3). Solver is set up to find the maximum of that function, which is in the pink 
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cell (F2), by adjusting the green cell (x, or G2). If you put any number near 8 in the 
green cell and mn Solver, it will adjust the green cell to 7.7, which is optimal. 

Note that Solver is careful in describing its answer: 

Solver has found a solution. All constraints and necessary conditions are sat- 
isfied. 

That is not quite the same thing as saying that the solution is guaranteed to be op- 
timal. In fact, if you start Solver with a number like 4 in the green cell, it will 
make the same speech after proposing that the solution is 3. Solver is basically hill 
climbing, by which we mean that it senses the slope of the function, and continu- 
ally moves uphill until it gets to the top (which it recognizes because the function 
is flat there) or can go no further on account of a constraint. While this will lead to 
an optimum in the sense that small changes to x will produce no improvement to 
fx), the final point may still not be optimal in a global sense. 

Of course, we would prefer it if computed solutions could actually be guaran- 
teed to be globally optimal. The principal class of problems where this is actually 
tme is considered next. 

B.2 Linear Programs 

In a linear program (LP), x=(xi,. . ,,x„) is a vector with n components, the oh- 

n 

jective function is (7 (x) = , where c is an arbitrary vector of constants, 

;=1 

and the constraints on x take the form that other linear combinations cannot ex- 
ceed given limits. If there are m such constraints, they can all be compactly repre- 
sented by letting x be a column vector and requiring Ax <b, where A is a mxn 
matrix and b is a constant column vector with m components. The objective can 
be either maximized or minimized, and the constraints can also include equalities 
or > inequalities. 

Linear programs with millions of variables and constraints are solvable with 
modern computer codes, and the results are (or should be) guaranteed to be opti- 
mal, with three caveats. One caveat is that there may be no feasible solution, in 
which case there is of course no optimal solution. Another is that the constraints 
may not limit the solution, so that C/(x) is unbounded. For example, the problem of 
maximizing x subject to v > 0 is a linear program with an unbounded solution. 
Analysts do not usually formulate unbounded or infeasible LPs deliberately, but 
computer codes like Solver must be prepared for anything. The last caveat is that 
the LP may simply be too large for the computer code to handle. Subject to these 
three caveats, the optimal solution of any LP is computable. 

Solver in the 2003 version of ExceF'^ can handle only 200 adjustable cells 
(« < 200), but that is sufficient for tutorial purposes, as well as for many applica- 
tions. 

Example 2: This is a surveillance problem where there are three platform types 
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available for maintaining surveillance over four areas. The number of platforms of 
each type i is an input h,. The platforms have different efficiencies in the different 
areas. Aircraft, for example, are most efficient at providing surveillance in areas 
that are near their bases. One platform of type i, if assigned to area j, will find any 
target in that area with rate a,y The 12 inputs Uy reflect the relative efficiencies of 
the three platform types in the four areas. The quantification of these inputs might 
involve the ideas of Chapter 7, especially Section 7.3.2, but here we take them as 
given. Let Xy be the number of platforms of type i assigned to patrol in area j, and 

let y. = S QyX^j be the total detection rate in area j. This is a linear equation that 

i 

relates the decision variables to a measure of effectiveness in each area. The object 
is to assign platforms so that the worst-case total detection rate is as large as pos- 
sible. This is most easily accomplished by introducing one more variable, the ob- 
jective V, while requiring that y, be at least v in all four areas. Once the adjustable 
cells (x,v) are optimized, the detection rate will be at least v in each of the four ar- 
eas. Another way of putting this is that the mean time to detection will be at most 
1/v. In summary, the LP is to maximize v, subject to the constraints 

^ayX^.-V>0 for ally, ^ X- < Zt. for all i, and Xy> 0 for all i and j. No ex- 

i j 

plicit reference to y, is necessary, so the LP has a total of 13 variables (« = 13). 
The constraints Xy> 0 are necessary; otherwise. Solver will try to make some of 
them negative. Counting these 12 constraints, there are a total of 19. See sheet 
“MaxRate” of AppendixB.xls for a solution. The solution will be globally optimal, 
regardless of the starting point for Solver, because the optimization problem is an 
LP. Suggested exercises based on this example are 

1) Examine the spreadsheet to see how the problem is set up for Solver. Solver 
does not perniit linear expressions in its dialog box, so all such computa- 
tions must be in the spreadsheet. Solver does not permit variables on the 
right-hand side of inequalities, so all variables must appear on the left, as in 
the formulation above. Solver does permit arrays on the left-hand side, so 
only one line in the dialog box is needed for all of the constraints x,y>0. 

2) Change the input data a and/or b and run Solver to see the impact on x and 

V. 

3) Delete the constraints Xy>0 to see what happens if you make a mistake like 

that. 

B.3 Integer Constraints 

The solution of Example 2 usually involves fractional values for some of the 
Xy variables. This might or might not be a problem. It might be possible to achieve 
the equivalent of assigning (say) 2.5 aircraft to a particular area by assigning 2 on 
some days and 3 on others. In that case the solution of the LP is exactly what is 
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needed. If this kind of time averaging is not possible, however, some variables 
will have to be restricted to integer values. The result is known as an integer pro- 
gram if all of the variables are so restricted, or as a mixed integer program (MIP) 
in the more common case where only some of them are. In the case of the surveil- 
lance example, we would restrict x to be integer, but not v, so we would have a 
MIP. In either case, the resulting optimization problem is of a llmdamentally more 
difficult type than is an ordinary LP. 

The added computational difficulty need not necessarily concern the user. In 
using Solver, it is simply a matter of adding a constraint that certain variables 
must be integers. In the dialog box, add a constraint where all of the subject vari- 
ables are named, and select “integer” from the dropdown relationship menu. The 
main import of the computational difficulty is that the user should have a height- 
ened sense of apprehension about what will happen when the “solve” button is 
pushed. There are examples where no solution at all is returned within reasonable 
time, and also examples where an incorrect solution is returned. 

Example 3: The knapsack problem is a famous integer programming problem 
where a backpacker can only carry b pounds, and must decide which items to put 
in his knapsack. Each item has a weight and a value, and any number of items of 
each type can be carried as long as the knapsack does not become heavier than h. 
The value of a collection of items is by assumption additive (which means that the 
naming of this kind of problem was unfortunate, since the value of three sleeping 
bags is surely not three times the value of one). The knapsack problem is trivial 
without the integer restriction - the solution is to carry as many as possible of the 
item with the greatest value/weight ratio. The integer restriction puts the problem 
into a class that is known to be fundamentally difficult. As of this writing. Solver 
gets the wrong solution to at least one knapsack problem, the one shown on sheet 
“Knapsack” of AppendixB.xls. The optimal solution is known to be 
X = (32,2,1,0,0,0,0), with a total value of 19,979. The Solver that comes with Ex- 
ceF'^ 2003 gets a score of only 19,972, even if the optimal solution is provided as 
a starting point. 

Example 3 is not meant to convince you that using Solver is a bad idea; 
19,972 is not a bad approximation to 19,979, after all, and Solver usually gets ex- 
actly the right answer. Nonetheless, bear in mind that, just as there are some re- 
markably complex looking problems are actually simple (any LP), there are also 
some remarkably simple sounding problems that are actually complex enough to 
justily skepticism about “optimal” solutions. 

Integer variables are sometimes restricted to take on only two values, 0 and 1, 
in which case they are called “binary” variables. Binary variables can be used to 
include certain logical relationships in a problem that would otherwise be an LP. 
Suppose that x represents the level of production of some commodity, and that the 
objective function includes a term -kx because each unit of x costs k to produce. 
The trouble is that a factory is required if the level of production is to be positive, 
but currently there is no factory. Introduce a binary variable z that indi- 
cates whether a factory is to be built, together with the temi -Kz in the objective 
function, where K is the cost of a factory. The logical idea that nothing can be 
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produced without a factory can be included by adding the constraint x<Mz, where 
M is a factory’s production capability. If z is 0, then x cannot be positive. If z is 1, 
then X can be as large as M. This method of including the idea that one system is 
required for the employment of another has wide applicability. It is employed in 
Section 10.2.2, for example, to enforce the idea that UAVs cannot fly without a 
ground control unit. There are special algorithmic techniques for handling binary 
variables, so it is wise to identify them as such (not as integer variables) when us- 
ing most optimization packages. In Solver, the same pulldown list that includes 
“inf’ also has “binary” on it. Choose the latter, when appropriate. 

B.4 Nonlinear Programs 

If any of the functions involved in expressing the decision problem are 
nonlinear functions of x, the result is called a nonlinear program (NLP). Example 
1 is a nonlinear program that illustrates the tendency of nonlinear solvers to find a 
locally optimal solution, but not necessarily a globally optimal one. Another illu- 
minating exercise would be to try maximizing x^ with Solver, starting from an ini- 
tial guess of 0. Nothing happens in the Solver that comes with ExceF'^ 2003, the 
difficulty being that the function is flat at the origin (with any other starting point. 
Solver will correctly report that the function is unbounded). 

Again, these cautionary tales are not meant to inhibit your use of Solver or 
other optimization software, but only to instill the habit of skepticism. Nonlinear 
problems are solved in several places in this book, with results being presented as 
“optimal” after only reasonableness tests on the part of the authors. Most of the 
time, in most reasonably posed problems. Solver will get the optimal solution. Be- 
sides, employing Solver is fail-safe - if you do not like Solver’s solution, you can 
always reject it in favor of the starting point you provided in the first place. 

It is worth some effort to try to express an NLP as an LP, if possible. An ex- 
ample of that might be Example 2, where yj was defined to be the total search rate 
in area j. We might maximize the function mm(yi,y 2 ,y 3 ,y 4 ), the smallest of the four 
search rates. This would cause us to save one variable (v) at the cost of making the 
objective function nonlinear, since min() is a nonlinear function. This is a no- 
brainer - there is great value in preserving the LP, and only a small sacrifice in in- 
troducing one more variable. The linear formulation given in Example 2 is best. 
Another example is in Section 10.2.2, where a nonlinear problem is converted into 
a linear problem by taking logarithms. 

B.5 Dynamic Programming (DP) 

Solver will not solve dynamic programs, nor will most other optimization 
packages. The reason is that DP problems do not take a standard form, but instead 
exploit a principle called the “principle of optimality.” This principle implies that 
the solution of a large problem may automatically imply the solution of a host of 
smaller problems. For example, suppose that the shortest route from New York to 
Los Angeles goes through Pittsburgh. Then it follows that the part of it going to 
Pittsburgh is also the shortest route from New York to Pittsburgh. This compara- 
tively innocent observation actually has considerable power, to the point where it 
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underlies not only shortest route computations but also computations in similar 
circumstances where a convenient “state” can be identified. The state of a shortest 
route problem is the city where one currently is. The state of the ABM problems 
considered in Chapter 4 is the number of ABMs remaining to allocate to whatever 
number of attackers survive the previous allocations. 

The basic DP method is to embed the given problem in a class of smaller 
problems interconnected by a recursive equation. At the end of computations, one 
has typically solved all smaller problems, as well as the given problem. The solu- 
tion method does not take well to direct implementation in a spreadsheet, but a 
programming language such as VBA (used in Chapter 4) is a sufficient extension. 
The inventor of DP is Richard Bellman, whose book with Dreyfus (1962) is still 
worth reading. Alternatively, see Winston (1994). 




Appendix C 

Monte Carlo Simulation 



Monte Carlo simulation is a method for investigating stochastic models that 
involves the use of random numbers, typically as generated by a computer using a 
random number generator. These random numbers can be used to imitate the kind 
of unpredictability sometimes exhibited by the real world. Through repetition, 
random variables of interest can often be quantified, even when direct probabilis- 
tic analysis fails. The technique is an old one, but did not become practical for ac- 
curate computations until the advent of the digital computer. It was used by E. L. 
de Forest in the Manhattan Project of World War II, at which time Stanley Ulam 
and John von Neumann named the technique. 

C.1 The General Idea 

Suppose that random variable X is of interest, but that the probability distribu- 
tion of X is unknown. One method of describing X is to repeatedly perform the 
fundamental experiment on which X is based, observe Tj in the ;th independent it- 
eration, and then use statistical methods to infer something about X from the ran- 
dom sample X\, X„.lf n is large, we can generally expect our inferences to be 
accurate. The random sample might be obtained from observations about the real 
world, but in Monte Carlo simulation it is generated artificially, nearly always us- 
ing a computer. 

It may seem remarkable that a random sample X\, ...,X„ could be generated 
under the premise that the probability distribution of X is unknown, but exactly 
that situation is remarkably common. For example, let X be the sum of two six- 
sided dice, as in the game of Craps, and consider the event that X is 11. What is 
P{X= 11)? Since X is not a standard kind of random variable, we cannot look the 
answer up in a table. However, samples oiX are easy to generate by simply rolling 
two dice and adding up the spots. “Rolling a die” is easy on a computer, so we can 
perform the experiment (say) 10,000 times and then estimate P{X=\\) to be the 
fraction of times when 11. The experimental fraction will not be exactly cor- 
rect, but it will be close to the exact value of 1/18. 

If you already know that the exact answer is 1/18, you might be wondering 
why anyone would be foolish enough to go to all that trouble to be slightly wrong 
about the chances of getting 1 1 in Craps. People who are skilled at probability cal- 
culations often have that reaction to Monte Carlo simulation. However, while it is 
certainly true that there is no point in simulating something to which you already 
know the answer, it is also trae that there is a distressing abundance of problems 
for which exact answers are lacking, even for skilled probabilists. Let X be the 
sum of 10 dice. Do you know P(30<T<50)? To calculate that number exactly, you 
would first calculate P(X= 30), and then add that to 20 other similar numbers. 
To calculate P{X=3G), you would need to count the number of ways in which 10 
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six-sided dice could sum to 30. All of this could certainly be done, but the process 
is time consuming and error prone. Why not just roll 10 dice to determine X, and 
repeat the experiment often enough to be confident in the experimental fraction? 
The replications can be performed very rapidly on a computer. 

For a more important example, consider nuclear fission. When a uranium 
atom releases a neutron, the neutron will travel in a uniformly random direction 
until it encounters something. It might encounter another uranium atom, thus gen- 
erating more neutrons, or it might be absorbed by some other kind of atom. The 
process might generate so many neutrons that it would become unstable, or it 
might not. To estimate the chances of instability, the Manhattan scientists em- 
ployed tables of random numbers to determine the details of individual interac- 
tions. Modem computers simply make the Monte Carlo process more efficient. 

C.2 Random Number Generation 

Computers are designed to be absolutely predictable in their computations, so 
finding a satisfactory way to generate “random” numbers is a challenging task. 
One method is to consult the system clock at the exact time when the number is 
needed, basing the result on the low-level digits. This method is seldom used be- 
cause it is not reproducible - even though we wish to generate “random” numbers, 
we also wish to be able to reproduce our experiments for debugging purposes. The 
usual compromise is to generate “pseudorandom” numbers in a sequence Vi, 
X 2 , ..., where the generation of x„+i depends on x„. For example, one famous 
method is to let x„+\ be the remainder after dividing ax„ by m, where a and m are 
constant positive integers. If m is a prime number, then x„+i will always be an in- 
teger larger than 0 and smaller than m. If a and m are carefully chosen, the se- 
quence will pass most statistical tests for being a sequence of independent random 
integers uniformly distributed over that interval, but will still be reproducible if 
necessary because the entire sequence depends on the “seed”xi. One particularly 
apt choice for 32-bit computers is a = 7^ and m = 2^^ - \ , which Park and Miller 
(1988) refer to as the “minimal standard.” 

Most programming languages contain a function for generating a sequence of 
random numbers uniformly distributed over the interval (0,1). One way of doing 
this would be to divide the minimal standard sequence by m, but the programmer 
need not be concerned with how the sequence is actually generated - whenever a 
random number is needed, it suffices to call the function. In ExceF'^, this function 
is called RAND(). RAND() is a “volatile” function; if you write “=RAND()” in 
two different cells, the numbers displayed will be different in the two cells, and 
furthermore will change every time you make any change in the spreadsheet. Fur- 
thermore the effects of “=2*RAND()” are different from “=RAND()-l-RAND(),” 
since the second expression calls the RAND() function twice while the first calls it 
only once. 

Given the availability of random numbers uniformly distributed in the interval 
(0,1), it turns out not to be difficult to generate random numbers having other dis- 
tributions. The theoretical reason is that, if FQ} is the cumulative distribution func- 
tion of random variable X, then F{X) is a uniform random variable. Since we can 




Appendix C 



265 



generate uniform random variables, we can also generate random variables from 
F() by employing its inverse function. In ExceF'^, for example, the expression 
“=NORMINV(RAND(), Al, A2)” will produce a normal random number with 
mean Al and standard deviation A2. There are a variety of commercial “addins” 
that provide functions for sampling from practically any standard distribution. 

Once we are able to generate random numbers with arbitrary distributions, we 
can employ them to simulate a variety of interesting problems that would other- 
wise be difficult to analyze. 

C.3 Statistical Issues and SimSheet.xls 

Monte Carlo simulation works by repeating the same experiment over and 
over, collecting and eventually displaying statistics about the outcome. Suppose 
that the outcome is a number Y, a random variable that summarizes the experi- 
ment. Natural statistics to report would be the sample mean, the standard error of 
estimating the mean, and perhaps a histogram showing the sample distribution. 
SimSheet.xls automates this process. That workbook contains an input cell Al that 
refers to whatever cell defines Y, and another input cell that specifies the number 
of replications desired. After adjusting these two cells, the user presses the “Simu- 
late” command button to repeat the experiment the desired number of times. Out- 
puts include those suggested above. SimSheet.xls can be employed with any other 
workbook, as long as both workbooks are open at the same time. On SimSheet, 
type “=” in cell Al. Then click the cell in the other workbook that you want to 
simulate, and press the green checkmark. 

The number of replications can be selected to make the standard error “suffi- 
ciently small,” which of course depends on the application. A few thousand repli- 
cations usually suffice. Confidence intervals can be constructed by the usual statis- 
tical techniques. A confidence interval for E{Y), for example, can be based on the 
sample mean and standard error reported by SimSheet.xls. 

C.4 Examples and Exercises 

Example 1: Consider the probability of getting 1 1 in Craps. A Monte Carlo simu- 
lation is on sheet “Craps” of AppendixC.xls. A die roll is put into each of two 
cells, and the sum is put into a third. Using SimSheet.xls, select (say) 10,000 repli- 
cations, press the simulate button, and examine the resulting histogram to deter- 
mine the fraction of rolls that are actually 11. The answer is unpredictable because 
RAND() is a volatile function, but should be close to 1/18. Alternatively, add the 
fonuula “=IF(C2=1 1,1,0)” to cell D2, which will put a 1 or a 0 into cell D2 de- 
pending on whether the total is 1 1 or not. The average of this quantity, which is 
automatically reported by SimSheet.xls if you link it to cell D2, is the fraction of 
replications where an 1 1 appears. 

Example 2: The Battle of 73 Easting was a battle in the first Gulf War where 
23 red tanks were killed by 12 blue tanks, with no losses to blue. Particularly re- 
markable was that seven of these kills were made by a single blue tank. Or is it 
remarkable? We would like to know the probability that this might happen by 
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chance, rather than through any special skill or circumstance affecting the record 
setting blue tank. To be precise, what is the probability P that the highest scoring 
blue tank would kill seven or more red tanks even if each red kill were made by a 
randomly selected blue tank? Calculation of P is difficult, but simulation is not. 
Sheet “73Easting” of AppendixC.xls does this. Each row of that sheet determines 
the fate of one red tank by using a single random number (using a new random 
number for every cell in the row would not give the desired result) to select its kil- 
ler. The Sum() and Max() functions of ExceF"^ are then sufficient to determine the 
number of red tanks killed by the highest scoring blue tank, recorded in cell N25. 
Cell N25 can then be linked to SimSheet.xls. Probability P turns out to be ap- 
proximately 0.02, so the chances of this lopsided tally happening “by chance” are 
small enough that some other explanation should be sought. 

There are other examples in the text. Example 3 of Section 6.3 involves a 
Monte Carlo simulation of a Lanchester battle. Chapter 8.xls includes separate 
sheets for the same minefield planning problem solved analytically and by Monte 
Carlo simulation - see Section 8.3 for a detailed description. 

C.5 Further Reading 

The above brief introduction is sufficient for this book, but there is much 
more to be said about simulation. Books on operations research such as Taha 
(2007) or Winston (1994) often include a section on Monte Carlo simulation, in- 
cluding methods for generating random samples from various distributions. In ad- 
dition to addins for ExceF'^, there are special-purpose simulation software pack- 
ages available that aid visualization and the formulation of complicated problems. 
Swain (2005) is a survey of over 40 different packages available. 
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